Co-modulators of nuclear receptors

ABSTRACT

A testing system for identifying effectors of polypeptides (ARAP3 variants) having the respective amino acid sequences represented in Seq. ID Nos. 14 to 18 and 20. In this test system a reporter gene is expressed in a cell transfected with a nucleic acid or in a cell transfected with a vector containing at least one copy of this nucleic acid; the cell, if it contains no nuclear receptor or only a small amount thereof, is also transfected with a vector containing DNA of the nuclear receptor; the cell is cultured in the presence of a plurality of test substances and a change in expression of the reporter gene is measured to find the effectors. The nucleic acid codes one of the polypeptides (ARAP3 variants), or has a nucleotide sequence represented in one of Seq. ID Nos. 1 to 13 and 19, or has a nucleotide sequence hybridized with one of Seq. ID Nos. 1 to 13 and 19 and/or one of the nucleic acid sequences coding polypeptides with an amino acid sequence represented in one of Seq. ID Nos. 14 to 18 and 20 under stringent conditions or has a degenerated nucleotide sequence corresponding to one of the foregoing nucleotide sequences within the framework of the degeneration of the genetic code.

CROSS-REFERENCE

[0001] This disclosure contains subject matter in common with U.S.provisional application, Ser. No. 60/311,699, filed Aug. 10, 2001.

REFERENCE TO SEQUENCE LISTING TABLES

[0002] Sequence listing tables are appended hereinbelow. First sequencelisting tables list sequences for nucleic acids of the inventiondesignated as Seq. ID Nos. 1 to 13 and 19. Second or following sequencelisting tables list the amino acid sequences of polypeptides of theinvention designated Seq. ID Nos. 14 to 18 and 20.

BACKGROUND OF THE INVENTION

[0003] 1. Field of the Invention

[0004] The invention relates to co-modulators of nuclear receptors andthe use thereof for preparing novel drugs.

[0005] 2. Description of the Related Art

[0006] The super-family of nuclear receptors, which includes about 50different proteins, consists of a group of related transcriptionfactors, which control the transcription of a particular target gene asa function of certain specific ligands. On the basis of certaincriteria, for example dimerization status, type of ligand or structureof the DNA binding element this family can be subdivided into severalsubfamilies (Beato et al., 2000, Human Reproduct. Update 6, 225-236). Acharacteristic feature of nuclear receptors is the matching structure offunctional domains (marked A to F) consisting of a highly variable, onlyslightly preserved N-terminal region with autonomous constitutiveactivation function (AF-1), a well preserved DNA-binding domain (DBD),which is responsible for recognition of special DNA-binding elements andconsists of two zinc finger motifs, a variable hinge domain and apreserved multifunctional C-terminal ligand-binding domain (LBD)consisting of a dimerization-dependent and ligand-dependenttransactivation function (AF-2). This is followed by the region locatedat the most remote C-terminal whose function is not known and which isabsent in receptors such as, for example, PR (progesterone receptor),PPAR (peroxisome proliferator-activated receptor) and RXR (retinoid Xreceptor) (Mangelsdorf & Evans, 1995, Cell 83, 841-850; Robyr et al.,2000, Mol. Endocrinol. 14, 329-347). It was demonstrated for somenuclear receptors (for example AR) that the N-terminal region is able tointeract with the C-terminal region (Brinkmann et al., 1999, J. SteroidBiochem. and Mol. Biol. 69, 307-313). Steroid hormone receptors such as,for example, estrogen receptors (ER), progesterone receptors (PR),glucocorticoid receptors (GR), mineralocorticoid receptors (MR) andandrogen receptors (AR) bind steroid ligands, such as the progestins,estrogens, glucocorticoids, mineralocorticoids and androgens all ofwhich are derived from pregnenolone. The binding of the ligands to NRactivates the receptor and controls the expression of the correspondingtarget genes.

[0007] Moreover, another class of proteins, known as co-modulators, hasbeen identified. These proteins play an important role either in theactivation (co-activators) or in the repression (co-repressors) of genetranscription as bridging molecules between the transcription initiationcomplex and the nuclear receptors (McKenna et al., 1999, Endocr. Rev.20, 321-347). A co-activator enhances the receptor function and in thepresence of an agonist—but not in the presence of anantagonist—interacts directly with the activation domains of nuclearreceptors. It also interacts with the basal transcription apparatus, butdoes not spontaneously enhance the basal transcription activity. Mostco-modulators interact with the AF-2 domain of nuclear receptors withthe aid of one or more LXXLL motifs (NR boxes) in the protein sequence.Several co-modulators, however, which can interact with other aminoacids with nuclear receptors (Ding et al. 1998, Mol. Endocrinol. 12,302-313) have been described. Moreover, many co-modulators, which insimilar manner interact with several different nuclear receptors havebeen identified.

[0008] It was shown for an estrogen receptor co-activator, known asA1B1, that it is expressed to a higher degree in cell lines of breastcancer and ovarian cancer and presumably plays an important role in thedevelopment of steroid hormone-dependent tumors (Anzick et al. 1997,Science 277, 965-968).

SUMMARY OF THE INVENTION

[0009] Besides influencing steroid hormone receptors directly withhormones or antihormones, modulating the interaction of co-activatorswith the steroid hormone receptors could be another approach to therapyfor hormone-dependent diseases. The problem, therefore, is to find a newco-activator whereby novel drugs can be provided.

[0010] This problem was solved by preparing nucleic acids consisting of

[0011] a) nucleic acids that code for polypeptides that have the aminoacid sequences represented in Sequence ID Nos. 14 to 18 and 20,

[0012] b) nucleic acids that have the nucleotide sequences representedin Sequence ID Nos. 1 to 13 and 19,

[0013] c) nucleic acids with nucleotide sequences hybridizing with oneof the sequences from a) and/or b) under stringent conditions and whichcode for polypeptides with the biological activity of a co-activator, or

[0014] d) nucleic acids with nucleotide sequences corresponding to thesequences from a), b) or c) within the framework of degeneration of thegenetic code.

[0015] Fragments of these sequences have been published in the GenBankdatabank under Accession Nos. AB037801, AK027280 and AK024991. Nofunctions were assigned to these fragments.

[0016] The concept “hybridization under stringent conditions” accordingto the present invention is defined by Sambrook et al. (MolecularCloning. A Laboratory Manual, Cold Spring Harbor Laboratory Press,1989). Stringent hybridization is present, for example, when after1-hour washing with 1×SSC and 0.1% SDS at 50° C., preferably at 55° C.,particularly at 62° C. and most preferably at 68° C., preferably for onehour in 0.2×SSC and 0.1% SDS at 55° C., particularly at 62° C. and mostpreferably at 68° C., a hybridization signal is still observed (SSCmeans standard saline citrate solution; SDS means sodiumdodecylsulfate). The nucleic acids, which under these conditionshybridize with the nucleic acids represented in Sequence ID Nos. 1 to 13and 19, or with one of the degenerate sequences within the framework ofdegeneration of the genetic code, are also an object of the presentinvention.

[0017] The nucleic acids can be a single-stranded or double-strandedDNA, for example cDNA, or an RNA, for example mRNA, cRNA or pre-mRNA.

[0018] The invention also concerns polypeptides coded by nucleic acidsof the invention or having the amino acid sequences represented inSequence ID Nos. 14 to 18 and 20. In the following, these polypeptidesof the invention are referred to as ARAP3 variants or, briefly, asARAP3. These nucleic acid variants are expressed by the use ofalternative promoters and by alternative processing or splicing of thepre-mRNA. ARAP3 variants contain a zinc finger domain. They showhomologies to other members of the zinc finger domain family such as,for example, hairless protein (Cachon-Gonzales et al., 1994, Proc. Natl.Acad. Sci. 91, 7717-7721) and testis-specific protein a (Hoog et al.,Mol. Reprod. Dev. 30,173-181). Moreover, the C-terminal amino acids havesignificant homologies to the jmjC domain of the jumonji family(Balciunas and Ronne, 2000, Trends Biochem. Sci. 25, 274-76). ARAP3 alsocontains a nucleus localization domain. ARAP3 binds to the androgenreceptor. The binding site is located in the region of amino acids 325to 919 of the androgen receptor (see FIG. 2, AR2 fragment). The bindingsites for the polypeptides of the invention are located as shown in thefollowing table: TABLE I AMINO ACID REGIONS OF THE DOMAINS OF ARAP3POLYPEPTIDES Androgen receptor Nucleus Seq. Zinc Binding Localization IDNo. Finger AS Domain Domain (NLD) JmjC-Domain 14 1664-1692 1732-18412183-2193 2199-2298 15 1858-1886 1926-2035 2377-2385 2393-2492 161846-1874 1914-2023 2365-2375 2381-2480 17 1627-1655 1695-1804 2146-21542162-2261 18 390-418 458-567 909-917 925-953 part. 20 1361-13891430-1539 1881-1898 1897-1996

[0019] ARAP3 has the function of a co-modulator of nuclear receptors,particularly steroid hormone receptors. Examples of such receptors arethe androgen receptor, estrogen receptor α, estrogen receptor β,progesterone receptor A, progesterone receptor B, glucocorticoidreceptor, mineralocorticoid receptor, thyroid hormone receptor, vitaminD receptor and peroxisome proliferator-activated receptor. ARAP3 bindsparticularly well to the androgen receptor thereby enhancing thereceptor function. The receptor function can be modulated, i.e.strengthened or weakened, by the binding.

[0020] In healthy humans, ARAP3 is expressed especially strongly in theheart, liver, testicles and ovaries.

[0021] Another object of the invention are vectors containing at leastone copy of a nucleic acid of the invention. The vectors can beprokaryotic or eukaryotic vectors. Examples of vectors are pPRO(Clontech), pBAD (Invitrogen), pSG5 (Stratagene), PCI (Promega), pIRES(Clontech), pBAC (Clontech), pMET (Invitrogen) and pBlueBac(Invitrogen). By methods known to those skilled in the art, the nucleicacids of the invention can be inserted into these vectors. The nucleicacids of the invention are preferentially linked to the expressionsignals such as, for example, promotor and enhancer on the vector.

[0022] The invention also relates to cells transfected with nucleic acidsequences of the invention or with a vector of the invention. For,example, E. coli, yeast, Pichia, Sf9, COS, CV-1 or BHK can be used asthe cells. These cells can be employed for the production of thepolypeptides of the invention or for testing systems.

[0023] The polypeptides of the invention or partial regions thereof(peptides) can be used for the preparation of antibodies. To producepolyclonal antibodies, the polypeptides or peptides, for example KLH(keyhole limpet hemocyanin) can be linked and injected into animals, forexample rabbits. They can also be used for preparing monoclonalantibodies. To prepare the antibodies, a polypeptide or peptide of theinvention or a mixture of several peptides of the invention can be used.The antibodies are prepared by standard procedures, as described, forexample, by Kohler, G. and Milstein, C., Nature 1975, 265, 495-497, andby Nelson, P.N. et al., Mol. Pathol. 2000; 53, 111-117.

[0024] Yet another object of the invention are antibodies directedagainst the polypeptides of the invention.

[0025] The antibodies of the invention can be used for detecting ARAP3and its variants. This can be done, for example, byimmunohistochemistry. The antibodies of the invention can also be usedin other immune tests, for example ELISA (enzyme linked immunosorbentassay) or in radjoimmune tests. In this manner, the concentration ofARAP3 can be determined in tissue extracts or cell extracts.

[0026] Detection of expression of the polypeptides of the invention canalso be accomplished through the detection of mRNA in the cells. Hence,an object of the invention is also the use of a probe with nucleic acidsequences that are complementary to the nucleic acid sequences codingfor ARAP3, for preparing a reagent for detecting the presence of themRNA of the invention in cells. A probe is a short piece of DNA with atleast 14 nucleotides. The probes of the invention can be used, forexample, in a Northern blot analysis. This method is described, forexample, in Sambrook, J., et al., 1989, Cold Spring Harbor LaboratoryPress. Other methods for detecting RNA are in-site hybridization, RNAseprotection assay or polymer chain reaction (PCR).

[0027] The detection or identification of ARAP3 expression is usedprimarily in androgen-dependent diseases. For example, elevated androgenactivation in diseases, such as prostate cancer and benign prostatetumors and in acne or loss of hair, can be attributed to increasedco-activator activity of ARAP3. On the other hand, a reducedco-activator activity can be present in hypogonadism, erectiledysfunction and androgen insensitive syndromes, for example thetesticular feminization syndrome. In these diseases, the balance ofandrogen receptor and co-activator can be disturbed. Hence, it isadvantageous to determine, besides the amount of expressed ARAP3, alsothe amount of expressed androgen receptor in the same tissue. Theandrogen receptor protein can also be determined by immunologicalmethods, for example by radioimmunoassay, ELISA or Western blot.

[0028] Another object of the invention is the use of ARAP3 or of thenucleic acids that code for it or its variants as target substance forpreparing an agent for treating steroid hormone-dependent diseases. Suchsteroid hormone-dependent diseases also include besides the above-saidandrogen-dependent diseases, for example, estrogen-dependent diseasessuch as breast cancer, and osteoporosis or cardiovascular diseases andvascular diseases. ARAP3 could also be used as target substance for thepreparation of drugs to influence male fertility.

[0029] In particular, the invention comprises the use of

[0030] a) a nucleic acid of the invention

[0031] b) a polypeptide of the invention or

[0032] c) a cell of the invention

[0033] for the purpose of identifying effectors of ARAP3.

[0034] Effectors are substances with an inhibitory or activating effecton ARAP3 and which are capable of influencing the co-activator functionof ARAP3. Preferred are substances, which modulate the interaction ofARAP3 with the androgen receptor. This can be tested, for example, bymeasuring the binding of the purified ARAP3 polypeptide to the androgenreceptor polypeptide in the presence of the test substance and comparingthe result with the control value obtained without the test substance.The binding can be determined with the aid of a marker bound to ARAP3 orto the androgen receptor. Such markers can be, for example, fluorescencemarkers, biotin or radioactive markers.

[0035] Another possibility of identifying effectors consists of using acell containing the cDNA of ARAP3 or of a functional part thereof, thecDNA of the androgen receptor or some other nuclear receptor and thecDNA of a reporter gene.

[0036] The reporter gene used can be, for example, luciferase. Theactivity of luciferase in this case reflects the activity of the nuclearreceptor. The cells are incubated in the presence of the test substance,and the luciferase activity is determined. The incubation can also becarried out in the additional presence of the ligand of the nuclearreceptor. For example, antagonistic effects can be measured in thismanner. The use of the androgen receptor and its ligand, an androgen, ispreferred.

[0037] The effectors of ARAP3 can be used for the treatment of steroidhormone-dependent diseases. In diseases involving, for example, strongandrogen activation, an inhibitor of the interaction of ARAP3 and theandrogen receptor can be administered, and in diseases that involvereduced co-activator activity, a stimulator of this interaction can beadministered.

[0038] Diseases involving a deficiency of ARAP3 can be treated also bymodifying the ARAP3 concentration in the affected tissues. To this end,into the tissue is introduced either a nucleic acid of the inventionwith the aid of a vector used in gene therapy, or a polypeptide of theinvention. In gene therapy, a vector containing a nucleic acid of theinvention is constructed and administered. Examples are vectors derivedfrom adenovirus, adenovirus-associated virus, herpes simplex virus orSV40. The gene therapy can be carried out by a protocol such as thatdescribed by Gomez-Navarro, J. et al. (Eur. J. Cancer 1999, 35,867-885). Administration can be local, namely directly into the affectedtissue, for example the tumor, or systemically, namely via the bloodcirculation. This results in increased expression of ARAP3.

[0039] ARAP3 can be administered in the form of a fusion polypeptide.With the aid of the fused polypeptide, for example EGF [epidermal growthfactor] or transferrin, the polypeptide of the invention is transportedpreferentially into the desired tissue, for example the tumor tissue.

[0040] Diseases can also be due to excessive expression of ARAP3. Inthis case, the nuclear receptors are so highly sensitized that they canbe activated not only by their ligands but also by other substances,which show no effect under physiological conditions. In this case, it isdesirable to reduce the expression of ARAP3. This can be done withantisense molecules. These molecules are complementary to the nucleicacid sequences represented in Seq. ID Nos. 1 to 13 and 19, or partsthereof.

[0041] Yet another object of the invention is a method for preparing apharmaceutical agent, wherein

[0042] a) substances are brought in contact with a testing system of theinvention,

[0043] b) the action of the substances on the testing system is measuredby comparison with a control,

[0044] c) a substance which in step b) showed modulation of ARAP3activity is identified, and

[0045] d) the substance identified in step c) is mixed with formulationmaterials commonly used for pharmaceuticals.

[0046] The activity of ARAP3, measured in step b), is represented by theenhancement of the receptor function of the nuclear receptor used.Preferred are steroid hormone receptors, the androgen receptor beingparticularly preferred.

[0047] The testing systems of the invention can also be used to testenvironmental samples. Many substances in the environment occur in suchlow concentrations that they exert an effect only on human steroidhormone receptors, which express the corresponding co-activator inincreased amounts. By use of a testing system of the invention, thesesubstances can be identified. A genetic predisposition to the action ofthese substances can be determined by ARAP3 detection according to theinvention.

BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWING

[0048] The objects, features and advantages of the invention will now beillustrated in more detail with the aid of the following examples, withreference to the accompanying figures in which:

[0049]FIG. 1 is a schematic representation of the androgen receptor, inwhich AF stands for activation function, DBD for the DNA binding domain,LBD for the ligand-binding domain and AS for amino acids and which showsthe fragment of the androgen receptor (AR2), which was used for thetwo-hybrid screen;

[0050]FIG. 2 is a schematic representation of the ARAP3 polypeptide Seq.ID No. 20, which shows the region binding to the androgen receptor andthe zinc finger domain;

[0051]FIG. 3 shows the tissue distribution of ARAP3 in a Northern blotanalysis in which 2 μg of human poly A+RNA was separated on a gel,transferred to a membrane and hybridized with a ARAP3-cDNA fragment (kbstands for kilobases); and

[0052] FIG. 4 shows the result of a mammal hybrid test with the ARAP3fragment AS 1430 to 1539 of Seq. ID No. 20 and the human androgenreceptor, which was performed as described in Example 2 below and whichemployed an analogous VP16 construct of an SRC1 fragment binding to theandrogen receptor as positive control.

EXAMPLES Example 1 Two-Hybrid Screen

[0053] By use of a cDNA library from fetal brain (Clontech MATCHMAKER)and a human AR fragment coding for the amino acids 325 to 919 as theprobe (FIG. 1), a screening was performed by means of the yeast-2-hybridsystem in the presence and in the absence of 10⁻⁶ mol ofdihydroxytestosterone (DHT). In accordance with the producer'sinstructions (Clontech), the number of screened clones was 3×10 ⁶ and2×10⁷. According to the producer's information, the number ofindependent clones was 3.5×10⁶. From these, we selected 800 positiveclones and tested them by a β-galactosidase assay, which confirmed 240as lacZ-positive clones. The inserts of these clones were amplified byPCR. By means of restriction fragment analysis and sequencing, at least17 different clones were identified. One of these was a clone with aninsert comprising 327 base pairs coding for part of the ORF (openreading frame) of KIAA1380 (gene bank access number AB03780 1). Inscreening the library with and without 10⁻⁶ mol of DHT, this clone wasidentified forty times.

[0054] With the aid of PCR techniques, ARAP3-cDNA was then lengthened.Different splicing variants were found for them, which represent thenucleic acids having the nucleic acid sequences Seq. ID Nos. 1 to 13.From these five polypeptides were derived with the amino acid sequencesSeq. ID Nos. 14 to 18.

Example 2 Mammal Hybrid Test

[0055] The binding of the ARAP3 fragment (AS 1430-1539) to the androgenreceptor (AR) was confirmed by the mammal hybrid test (FIG. 4). TheARAP3 fragment as fusion protein was cloned with VP16 (CMX VP16-ARAP3-III). Human PC3 cells were transfected with CMX-VP1 6-ARAP3-III,the expression vector that contains the complete AR, (pSG5AR) and withthe reporter gene, luciferase, which is under the control of theAR-dependent MMTV promoter (pMMTV-luc). After 24 hours, the cells wereincubated with dihydroxytestosterone (DHT). After an additional day, thecells were lysed, and the activity of the reported gene luciferase wasdetermined. The results were normalized by means of the protein contentof the preparations, determined in parallel. The control used was apreparation with the empty expression vector CMX-VP1 6-empty.

Example 3 Determination of the Co-Activator Activity of ARAP3

[0056] An eukaryotic expression vector, for example pCMX which containsARAP3-cDNA that codes for the complete ARAP3 protein or for a functionalpart thereof (PCMX-ARAP3), was transfected in suitable cell lines, forexample SH-SY5Y or PC3, together with pSG5AR and pMMTV-luc. The controlwas a preparation with the empty expression vector pCMX. The androgenactivity can be determined from the activity of the reporter luciferaseas in the foregoing example. The effect of ARAP3 on the androgenicsignal can be determined by comparison with the activity of the controlpreparation.

Example 4 Determination of the Binding of ARAP3 to the Androgen Receptorby Means of a Pull-Down Experiment

[0057] A GST pull-down experiment is characterized by an experimentalprocedure, which allows binding of an in vitro expressed GST fusionprotein to a similar in vitro expressed and radioactively labeledprotein, subsequent separation from non-interacting proteins anddetection and identification of the bound protein as a measure of thebinding. The experiments were performed according to the protocol asdescribed in Sambrook and Russel, Molecular Cloning, Volume 3, Chapter18, Procedure 3 (p. 18.55 ff); Cold Spring Harbor Laboratory Press, NewYork, 2001.

[0058] For the expression an ARAP3 fragment (AS 1735 to 1840 of Seq. IDNo. 14) was cloned as a GST fusion protein in the vector pGEX-KG(Pharmacia) and was over-expressed and prepared in bacterial cellsaccording to the procedure of the manufacturer. The ARAP3 fusion proteinwas bound to glutathione sepharose spherical particles and incubatedwith a 35S-methionine-labeled androgen receptor (pSG5AR with T7 TNTreticulozyte lysate of Promega). Subsequently this was washed withcentrifuging. The bound portion of the androgen receptor to theGST-ARAP3 spherical particles was determined by SDS-PAGE and subsequentautofluorography. The binding ratio could be formed in a similartreatment of non-fusioned GST spherical particles relative to theGST-ARAP3 spherical particles in an experimental start by comparison ofthe blackening of the androgen receptor protein band on the hyperfilm(TM Amersham) and densitometry analysis by means of density integration.Validated binding ratio values for GST-ARAP3 to GST-empty resulted.These binding ratios are tabulated in Table II below and shown with theamounts of protein used as determined by Western Blot. These bindingratios are regarded as detection or proof of the binding of ARAP3 to theandrogen receptor. TABLE II Protein, Western Binding, Pull-down RatioGST- GST- GST- GST-ARAP3/ Exp. empty In ARAP3 GST-empty ARAP3 GST-emptyNo. Vol./μl In Vol./μl In Vol./μl In Vol./μl Ratio 1 445 220 160 209426.5 2 833 351 10.6 ≈5000 1122 3 4614 208 <1 257 >>100 4 349 382 <1 225>>100

[0059] The disclosure in German Patent Application 101 35 787.7 of Jul.23, 2001 is incorporated here by reference. This German PatentApplication describes the invention described hereinabove and claimed inthe claims appended hereinbelow and provides the basis for a claim ofpriority for the instant invention under 35 U.S.C. 119.

[0060] While the invention has been illustrated and described asembodied in compositions for transferring active compounds in acell-specific manner, it is not intended to be limited to the detailsshown, since various modifications and changes may be made withoutdeparting in any way from the spirit of the present invention.

[0061] Without further analysis, the foregoing will so fully reveal thegist of the present invention that others can, by applying currentknowledge, readily adapt it for various applications without omittingfeatures that, from the standpoint of prior art, fairly constituteessential characteristics of the generic or specific aspects of thisinvention.

[0062] What is claimed is new and is set forth in the following appendedclaims.

1 20 1 8734 DNA Homo sapiens 1 cacaaggctg cgaatctggg agtccaaaggagtgtttgcg gggagggccc gaggtcgctg 60 tgtccggggc agagcggccg gttcgtcccgagtctgcgct ctttcgggtc cgctgctgtg 120 tcccggtccg ctctcctcag catgagcggccggtaggagt gaggtttcat cggttccctc 180 gcactggagg aggcagcggc cgcttcggcagcgacagcta tggcggtaga gacgcgggca 240 gagctggtgg gtaagcggtt cctgtgtgtggcggtcggcg acgaggcacg ttcggagcgc 300 tgggagagcg gacgcggctg gcgaagctggcgagcggggg tcatccgagc cgtgtcacac 360 agggacagcc gcaatccgga cctggcgatgattcaaataa gtggaaatgg aaactcttca 420 ggtgtgtatg tggaatttga tgatcttgaatgggataaac gagagtgggt taaagtttat 480 gaagattttt caactttctt ggtggaataccacttaatct gggccaaaag gaatgaccct 540 agccagactc agggatcaaa gagcaaacagattcagtggc ctgcattgtt ctgggataca 600 tatgcggaac gtgccggttt gttacatagactttcaaacc tctggttgaa agaaatatac 660 ccagttcagt cactgcagta gaattccttgtagataagca actggatttt ttaactgaag 720 atagtgcctt tcagccctac caggacgacatagacagcct aaacccagtt ctcagggaca 780 acccgcagct tcatgaggaa gtgaaagtctgggtaaagga acaaaaggtt caggagattt 840 ttatgcaagg tccttattcc ttaaatggatacagagtgag agtatataga caagactctg 900 ccacccagtg gtttactggc ataattactcatcatgatct cttcacccgc accatgatcg 960 ttatgaatga tcaggtacta gaaccacagaatgtcgatcc ttctatggtt caaatgacct 1020 ttctagatga tgttgttcac tctttgttaaaaggtgaaaa tattggcatt acatcacgac 1080 gcaggtctcg tgccaatcaa aacgtcaacgctgttcacag ccattataca cgtgcccaag 1140 caaatagtcc cagaccagca atgaactcccaagctgctgt accaaaacag aatacacacc 1200 agcaacagca acaaagaagt atccgtccaaataagaggaa gggctcagat agcagtatac 1260 cagatgaaga gaagatgaag gaggaaaaatatgattatat atcacgagga gaaaatccta 1320 aaggtaaaaa caaacacttg atgaataaaagaaggaaacc tgaggaggat gaaaagaaac 1380 taaatatgaa aagacttcga actgacaatgtttcagactt ttctgagagc agtgactcag 1440 aaaattcaaa taagagaata atagataattcctcagaaca gaagccagag aatgaattga 1500 aaaataaaaa tacttcaaaa ataaatggagaagaaggaaa accccataat aatgagaagg 1560 caggagaaga gaccctaaaa aatagccagcctccctggga tcaaatacag gaagataaaa 1620 aacatgaaga agcagagaag cggaagtctgttgacactca gcttcaagaa gatatgatta 1680 ttcattcgtc agaacagtcc acagtttctgatcataattc taatgattta cttcctcagg 1740 aatgcaatat ggataaaaca cataccatggaattgctacc aaaggagaag tttgtatcca 1800 gaccacccac accaaaatgt gttattgatattacaaatga cactaattta gaaaaggtgg 1860 ctcaggaaaa ctcaagtacc tttggccttcagacacttca gaaaatggat cctaatgtta 1920 gtgattcaaa acactctatt gcaaatgcaaaattcttgga aacagcaaaa aaagattctg 1980 accagagctg ggtcagtgat gtagttaaagtggatctaac ccaatcaagt gttacaaatg 2040 cttcttcagg aaatgatcac ttgaacatggaaaaagagaa gtatgtctct tacatttctc 2100 ctttaagtgc agtttctgtc atggaagataagctgcataa gcgaagtcca cctccagaga 2160 ctataaaatc taaacttaat acttcagtagatactcacaa gataaaatcc agcccatcac 2220 ctgaagttgt taaacccaaa ataactcattctcctgattc tgtaaagtct aaggccactt 2280 atgtgaacag ccaagctact ggtgaaagaagattggcaaa taagatagaa catgagctat 2340 caagatgcag ttttcatcca attcctactcgaagcagtac attagaaact acaaagagtc 2400 ctcttatcat tgataaaaat gagcattttacagtttacag agatcctgca cttattgggt 2460 cagaaacagg agctaatcat atttcacctttcctaagcca gcatcctttt cctcttcact 2520 cctcatctca tagaacctgt ttaaatccaggtacccatca tcctgcctta actcctgcac 2580 cccatttact agccggatca tctagtcaaactccattacc taccattaac actcatcctc 2640 tgactagtgg tccacaccat gctgttcatcaccctcattt acttcccact gtgttacctg 2700 gagtgcctac tgcctcctta cttggtggccacccacgact agagagtgct catgccagca 2760 gcttgagcca cttagcgcta gcacaccagcaacaacaaca gttgttacag caccagtcac 2820 ctcatcttct tggacaagcc catccttctgcttcatataa tcagcttgga ctttatccaa 2880 ttatttggca gtatccaaat ggaacacatgcatactcagg acttggtttg ccttcttcta 2940 agtgggttca cccagaaaat gcagttaatgctgaagcttc attaaggagg aattctccca 3000 gtccttggct acatcagccc acccctgtgacctcagcaga tggtattgga ttacttagtc 3060 acattcctgt cagaccttcc agtgcagagcctcatcggcc tcttaaaatt acagcccatt 3120 ccagtccacc attgacaaaa actttagtagatcatcataa ggaagaatta gaaagaaaag 3180 cttttatgga accattacgg tctgttgcatccacatcagc caaaaatgac ctggatctaa 3240 ataggtcaca gactggaaaa gattgtcacttacataggca ttttgtggat ccagtattaa 3300 atcagttaca gaggccaccc caggagactggagagaggtt aaacaaatac aaagaggaac 3360 accgtcgaat tcttcaagaa agtattgatgttgctccctt tacaactaaa atcaagggac 3420 ttgagggtga gagagagaat tattccagagtggcatcatc atcttccagt cctaaaagcc 3480 atatcatcaa acaagatatg gatgtagaacgctcagtatc agatctttat aaaatgaagc 3540 actcagtgcc tcagagttta ccccaaagtaactatttcac tacattgtct aatagtgtgg 3600 tcaatgaacc accaagatca tacccatccaaagaagtttc aaatatttac ggtgataaac 3660 agagtaatgc ccttgcagcg gcagcagctaatcctcaaac tctgacttca tttataacat 3720 ctctttcaaa gcctccacct ttgattaaacaccaaccaga aagtgaaggt ttagtaggca 3780 agataccaga acatcttcca catcagattgcatctcactc agtaacaacc ttcagaaatg 3840 attgtaggag tcctacccat ttgacagtttcttctacaaa tacactccgc agtatgcctg 3900 cattacatag agcaccagta tttcacccaccaatccatca cagcctggaa agaaaggaag 3960 gcagctatag tagtctttcc cctccaactttaactccggt gatgccagta aatgctggtg 4020 gtaaagttca agaatcacag aagcctccaactctaatacc cgaaccaaaa gactcccagg 4080 caaattttaa gagttcttca gaacagagtttgacggagat gtggagacct aataataacc 4140 tcagcaaaga gaaaactgaa tggcatgtggagaaaagcag cggaaagtta caggctgcta 4200 tggcatctgt cattgtgcgt ccatcttctagtacaaaaac tgatagtatg ccagcaatgc 4260 agttagcttc taaagatcga gttagtgaaagatcttcagc tggggcacat aaaacagatt 4320 gcctcaaact agcagaagcc ggagaaactggaagaatcat tttgccaaat gtgaattcag 4380 acagtgttca cacaaaatct gaaaaaaactttcaggctgt ctcacagggc agtgttccca 4440 gttcagtcat gtctgctgta aatacgatgtgtaataccaa aacggatgta atcacatctg 4500 ctgccgatac taccagtgtt tccagctggggtggttcaga agtaatttcc tctttatcaa 4560 ataccatttt ggcctctaca tcatcagaatgtgtatcttc aaaaagtgtc agtcagccag 4620 tggctcaaaa acaagaatgc aaggtcagcaccacagcacc agttacatta gccagtagta 4680 agacaggaag tgttgttcaa cccagttctgggttctcagg cacaactgat tttatccatt 4740 taaaaaagca caaggcagca ttggctgcagctcagtataa aagtagtaat gccagtgaga 4800 ctgaacctaa tgctataaaa aatcagacactttcagcctc ccttcctctg gatagcactg 4860 taatctgtag tacaattaac aaagcaaactctgtaggaaa tgggcaagct tcccagacaa 4920 gtcaaccaaa ctaccatact aaactgaaaaaggcctggct caccagacac tcagaagaag 4980 ataaaaatac taataaaatg gaaaattcagggaattctgt atcagaaatt attaagccat 5040 gttctgtcaa cttaatagcc tctacatctagtgatataca aaatagtgta gatagtaaga 5100 tcatagttga taaatatgta aaagatgataaagtcaacag gagaaaagcc aaaagaactt 5160 atgaatctgg ctctgaaagt ggagactcagatgaaagtga aagcaagtca gagcaaagga 5220 ctaaaagaca acctaagcca acttacaaaaagaagcaaaa tgatttgcaa aagagaaaag 5280 gtgaaataga agaagatttg aaacccaatggagttctcag caggagtgcc aaagaaagaa 5340 gtaaactgaa gttgcaaagc aacagtaatactggcattcc tcgttcagta ttgaaagatt 5400 ggcgtaaagt caagaagctg aagcaaactggggaatcctt tttacaggat gactcctgct 5460 gtgagatagg gcctaattta caaaagtgtcgagaatgtag acttattcgc agtaaaaaag 5520 gagaagaacc agctcactca ccagtattttgtagatttta ctactttaga cggttgtcat 5580 ttagtaaaaa cggagtagtt agaatagatggtttctcttc tcctgaccaa tatgatgatg 5640 aagctatgag tttgtggaca catgaaaattttgaagatga tgaactagat atagagactt 5700 ctaaatatat cttggatata ataggtgataagttctgtca attagtaaca tctgaaaaaa 5760 cagctttgtc ctgggtgaaa aaggatgccaaaattgcctg gaaaagagca gtgagaggag 5820 tccgggagat gtgtgatgca tgtgaagcaacattgtttaa cattcactgg gtctgccaaa 5880 aatgtggatt tgtggtctgc ttagattgttacaaggcaaa ggaaaggaag agttctagag 5940 ataaagaact atatgcttgg atgaagtgtgtgaagggaca gcctcatgat cacaaacatt 6000 taatgccaac ccaaattata cctggttctgttttgacaga tcttctagat gccatgcaca 6060 ctcttaggga aaaatatggt attaaatcccattgtcattg tactaacaaa cagaatttac 6120 aagttggaaa ttttcctaca atgaatggtgtatctcaagt tttacagaat gttcttaatc 6180 acagtaataa aatttctctg tgcatgcctgagtctcagca gcaaaatact cctccgaagt 6240 ctgagaaaaa tggtggcagc agcccagagagtgatgtagg cacagataac aagttaactc 6300 ctccagaatc ccagtcacca ctgcactggttagcagatct tgcagagcaa aaagccagag 6360 aggaaaaaaa agaaaacaaa gaacttacccttgaaaacca aattaaagaa gaaagagaac 6420 aagacaactc tgaatctcca aatggcagaacatcacctct tgtgtcccag aataatgaac 6480 aaggctcaac cttacgggat ttgctgactacaacagctgg aaagctacgt gtggggtcta 6540 cagatgctgg cattgccttt gccccagtatattcaatggg agccccaagt agcaaaagtg 6600 gacggactat gcctaacatt cttgatgacataattgcttc agttgttgaa aacaaaattc 6660 caccaagtaa aacctccaag ataaatgtaaaaccagagct taaagaagag cctgaagaaa 6720 gcataatatc tgcagtggat gaaaataataaattatacag tgatatacca cattcttgga 6780 tctgtgagaa gcatatttta tggcttaaggattataagaa tagcagtaat tggaagcttt 6840 tcaaagaatg ttggaaacaa ggacagcctgcagtggtttc tggtgtgcat aagaaaatga 6900 acattagcct atggaaggcg gaatcaattagtcttgattt tggagaccac caagctgatc 6960 tcctgaactg caaagatagc atcatttcaaatgccaatgt taaggaattc tgggatggtt 7020 ttgaagaagt ttcaaaacgg cagaaaaacaagagtggaga aacagttgtt ttaaaattga 7080 aagactggcc ttcaggagaa gacttcaagactatgatgcc agcaagatac gaagatcttt 7140 taaaaagtct gccattgcca gaatattgtaatccagaagg aaaattcaat ttggcctctc 7200 atttgccagg attttttgta cgtcctgatctaggacccag gttgtgcagt gcctatggtg 7260 tagttgctgc taaagatcat gatataggaacaacaaatct ccatattgaa gtttctgatg 7320 ttgtaaatat actagtttat gttggcatagcaaaaggaaa tggcattctc tcaaaagcag 7380 gaattctcaa gaaatttgag gaagaagatttggatgacat tttaaggaaa agattgaagg 7440 actcaagtga aatacctggt gctctgtggcatatttatgc tgggaaagat gttgacaaga 7500 taagggaatt tcttcaaaag atttcaaaagaacaaggcct tgaagttcta ccagaacatg 7560 atccaatacg tgaccaaagt tggtatgtgaacaaaaagct ccgtcaaagg ctgcttgaag 7620 aatatggagt cagaacctgt actcttattcagttccttgg tgatgctatt gttttgccag 7680 cgggagcact tcatcaggtt cagaattttcacagctgtat tcaggtaact gaagattttg 7740 tgtctccaga acatcttgta gagtcatttcatttaacaca ggaactgaga cttttgaagg 7800 aagaaatcaa ttatgatgat aaactacaggttaaaaatat tttgtatcat gcagtcaaag 7860 aaatggtgag agccttgaag atacacgaggatgaagtaga ggatatggaa gaaaattaag 7920 tgtgatccag tttgatattt ttaggttgttgaactgggat tacttaacct tgaatgatga 7980 tatgtatgca cactgacttt aagcttcataaaaccatcag tgccaagaaa ttctctttgt 8040 agtaattact tgttactgac accacagcagtatagcatat gtcacagctc ctgtgattca 8100 atgttataaa acaagcagaa ttttaaaagcagcactatat agctgttttg tattatagtg 8160 tatatgatgt ttgtgaaaat gccagatttaaaatgatgta tttatttttg gtaaaaaata 8220 aaaaattcta tgctatattg ttgatcaagtgtaaatgtga ccttgtacag tttactaaaa 8280 ttactgatat ttttcactac attgagacagttactgtgag aataggacac aaacaccagc 8340 tattgcctgc atctgggaaa ttgctgaatcgcacagcagt catgtcataa tcagaaaatt 8400 actgccaaat aattgtaaaa tttgtaaagtataaagtata taaagtagat actaaataca 8460 gacacttcaa tattttgttg aagctattgactgtacaatt aaacattttc aaaaggtgta 8520 atttatttaa aattgtctca ttttggtaaaatttatgtga acttttaaag ctaaatatta 8580 aacttaatat gctatgtaaa tatatacatatatacattta atgatgtatt tttttaaaac 8640 attggcttgc ttttgttaaa gtgcaagtgttacatatggc tttgtacatt aaagttgaaa 8700 ggggttttac attttccatt aaaaggactttatc 8734 2 8693 DNA Homo sapiens 2 cacaaggctg cgaatctggg agtccaaaggagtgtttgcg gggagggccc gaggtcgctg 60 tgtccggggc agagcggccg gttcgtcccgagtctgcgct ctttcgggtc cgctgctgtg 120 tcccggtccg ctctcctcag catgagcggccggtaggagt gaggtttcat cggttccctc 180 gcactggagg aggcagcggc cgcttcggcagcgacagcta tggcggtaga gacgcgggca 240 gagctggtgg gtaagcggtt cctgtgtgtggcggtcggcg acgaggcacg ttcggagcgc 300 tgggagagcg gacgcggctg gcgaagctggcgagcggggg tcatccgagc cgtgtcacac 360 agggacagcc gcaatccgga cctggcgatgattcaaataa gtggaaatgg aaactcttca 420 ggtgtgtatg tggaatttga tgatcttgaatgggataaac gagagtgggt taaagtttat 480 gaagattttt caactttctt ggtggaataccacttaatct gggccaaaag gaatgaccct 540 agccagactc agggatcaaa gagcaaacagattcagtggc ctgcattgac tttcaaacct 600 ctggttgaaa gaaatatacc cagttcagtcactgcagtag aattccttgt agataagcaa 660 ctggattttt taactgaaga tagtgcctttcagccctacc aggacgacat agacagccta 720 aacccagttc tcagggacaa cccgcagcttcatgaggaag tgaaagtctg ggtaaaggaa 780 caaaaggttc aggagatttt tatgcaaggtccttattcct taaatggata cagagtgaga 840 gtatatagac aagactctgc cacccagtggtttactggca taattactca tcatgatctc 900 ttcacccgca ccatgatcgt tatgaatgatcaggtactag aaccacagaa tgtcgatcct 960 tctatggttc aaatgacctt tctagatgatgttgttcact ctttgttaaa aggtgaaaat 1020 attggcatta catcacgacg caggtctcgtgccaatcaaa acgtcaacgc tgttcacagc 1080 cattatacac gtgcccaagc aaatagtcccagaccagcaa tgaactccca agctgctgta 1140 ccaaaacaga atacacacca gcaacagcaacaaagaagta tccgtccaaa taagaggaag 1200 ggctcagata gcagtatacc agatgaagagaagatgaagg aggaaaaata tgattatata 1260 tcacgaggag aaaatcctaa aggtaaaaacaaacacttga tgaataaaag aaggaaacct 1320 gaggaggatg aaaagaaact aaatatgaaaagacttcgaa ctgacaatgt ttcagacttt 1380 tctgagagca gtgactcaga aaattcaaataagagaataa tagataattc ctcagaacag 1440 aagccagaga atgaattgaa aaataaaaatacttcaaaaa taaatggaga agaaggaaaa 1500 ccccataata atgagaaggc aggagaagagaccctaaaaa atagccagcc tccctgggat 1560 caaatacagg aagataaaaa acatgaagaagcagagaagc ggaagtctgt tgacactcag 1620 cttcaagaag atatgattat tcattcgtcagaacagtcca cagtttctga tcataattct 1680 aatgatttac ttcctcagga atgcaatatggataaaacac ataccatgga attgctacca 1740 aaggagaagt ttgtatccag accacccacaccaaaatgtg ttattgatat tacaaatgac 1800 actaatttag aaaaggtggc tcaggaaaactcaagtacct ttggccttca gacacttcag 1860 aaaatggatc ctaatgttag tgattcaaaacactctattg caaatgcaaa attcttggaa 1920 acagcaaaaa aagattctga ccagagctgggtcagtgatg tagttaaagt ggatctaacc 1980 caatcaagtg ttacaaatgc ttcttcaggaaatgatcact tgaacatgga aaaagagaag 2040 tatgtctctt acatttctcc tttaagtgcagtttctgtca tggaagataa gctgcataag 2100 cgaagtccac ctccagagac tataaaatctaaacttaata cttcagtaga tactcacaag 2160 ataaaatcca gcccatcacc tgaagttgttaaacccaaaa taactcattc tcctgattct 2220 gtaaagtcta aggccactta tgtgaacagccaagctactg gtgaaagaag attggcaaat 2280 aagatagaac atgagctatc aagatgcagttttcatccaa ttcctactcg aagcagtaca 2340 ttagaaacta caaagagtcc tcttatcattgataaaaatg agcattttac agtttacaga 2400 gatcctgcac ttattgggtc agaaacaggagctaatcata tttcaccttt cctaagccag 2460 catccttttc ctcttcactc ctcatctcatagaacctgtt taaatccagg tacccatcat 2520 cctgccttaa ctcctgcacc ccatttactagccggatcat ctagtcaaac tccattacct 2580 accattaaca ctcatcctct gactagtggtccacaccatg ctgttcatca ccctcattta 2640 cttcccactg tgttacctgg agtgcctactgcctccttac ttggtggcca cccacgacta 2700 gagagtgctc atgccagcag cttgagccacttagcgctag cacaccagca acaacaacag 2760 ttgttacagc accagtcacc tcatcttcttggacaagccc atccttctgc ttcatataat 2820 cagcttggac tttatccaat tatttggcagtatccaaatg gaacacatgc atactcagga 2880 cttggtttgc cttcttctaa gtgggttcacccagaaaatg cagttaatgc tgaagcttca 2940 ttaaggagga attctcccag tccttggctacatcagccca cccctgtgac ctcagcagat 3000 ggtattggat tacttagtca cattcctgtcagaccttcca gtgcagagcc tcatcggcct 3060 cttaaaatta cagcccattc cagtccaccattgacaaaaa ctttagtaga tcatcataag 3120 gaagaattag aaagaaaagc ttttatggaaccattacggt ctgttgcatc cacatcagcc 3180 aaaaatgacc tggatctaaa taggtcacagactggaaaag attgtcactt acataggcat 3240 tttgtggatc cagtattaaa tcagttacagaggccacccc aggagactgg agagaggtta 3300 aacaaataca aagaggaaca ccgtcgaattcttcaagaaa gtattgatgt tgctcccttt 3360 acaactaaaa tcaagggact tgagggtgagagagagaatt attccagagt ggcatcatca 3420 tcttccagtc ctaaaagcca tatcatcaaacaagatatgg atgtagaacg ctcagtatca 3480 gatctttata aaatgaagca ctcagtgcctcagagtttac cccaaagtaa ctatttcact 3540 acattgtcta atagtgtggt caatgaaccaccaagatcat acccatccaa agaagtttca 3600 aatatttacg gtgataaaca gagtaatgcccttgcagcgg cagcagctaa tcctcaaact 3660 ctgacttcat ttataacatc tctttcaaagcctccacctt tgattaaaca ccaaccagaa 3720 agtgaaggtt tagtaggcaa gataccagaacatcttccac atcagattgc atctcactca 3780 gtaacaacct tcagaaatga ttgtaggagtcctacccatt tgacagtttc ttctacaaat 3840 acactccgca gtatgcctgc attacatagagcaccagtat ttcacccacc aatccatcac 3900 agcctggaaa gaaaggaagg cagctatagtagtctttccc ctccaacttt aactccggtg 3960 atgccagtaa atgctggtgg taaagttcaagaatcacaga agcctccaac tctaataccc 4020 gaaccaaaag actcccaggc aaattttaagagttcttcag aacagagttt gacggagatg 4080 tggagaccta ataataacct cagcaaagagaaaactgaat ggcatgtgga gaaaagcagc 4140 ggaaagttac aggctgctat ggcatctgtcattgtgcgtc catcttctag tacaaaaact 4200 gatagtatgc cagcaatgca gttagcttctaaagatcgag ttagtgaaag atcttcagct 4260 ggggcacata aaacagattg cctcaaactagcagaagccg gagaaactgg aagaatcatt 4320 ttgccaaatg tgaattcaga cagtgttcacacaaaatctg aaaaaaactt tcaggctgtc 4380 tcacagggca gtgttcccag ttcagtcatgtctgctgtaa atacgatgtg taataccaaa 4440 acggatgtaa tcacatctgc tgccgatactaccagtgttt ccagctgggg tggttcagaa 4500 gtaatttcct ctttatcaaa taccattttggcctctacat catcagaatg tgtatcttca 4560 aaaagtgtca gtcagccagt ggctcaaaaacaagaatgca aggtcagcac cacagcacca 4620 gttacattag ccagtagtaa gacaggaagtgttgttcaac ccagttctgg gttctcaggc 4680 acaactgatt ttatccattt aaaaaagcacaaggcagcat tggctgcagc tcagtataaa 4740 agtagtaatg ccagtgagac tgaacctaatgctataaaaa atcagacact ttcagcctcc 4800 cttcctctgg atagcactgt aatctgtagtacaattaaca aagcaaactc tgtaggaaat 4860 gggcaagctt cccagacaag tcaaccaaactaccatacta aactgaaaaa ggcctggctc 4920 accagacact cagaagaaga taaaaatactaataaaatgg aaaattcagg gaattctgta 4980 tcagaaatta ttaagccatg ttctgtcaacttaatagcct ctacatctag tgatatacaa 5040 aatagtgtag atagtaagat catagttgataaatatgtaa aagatgataa agtcaacagg 5100 agaaaagcca aaagaactta tgaatctggctctgaaagtg gagactcaga tgaaagtgaa 5160 agcaagtcag agcaaaggac taaaagacaacctaagccaa cttacaaaaa gaagcaaaat 5220 gatttgcaaa agagaaaagg tgaaatagaagaagatttga aacccaatgg agttctcagc 5280 aggagtgcca aagaaagaag taaactgaagttgcaaagca acagtaatac tggcattcct 5340 cgttcagtat tgaaagattg gcgtaaagtcaagaagctga agcaaactgg ggaatccttt 5400 ttacaggatg actcctgctg tgagatagggcctaatttac aaaagtgtcg agaatgtaga 5460 cttattcgca gtaaaaaagg agaagaaccagctcactcac cagtattttg tagattttac 5520 tactttagac ggttgtcatt tagtaaaaacggagtagtta gaatagatgg tttctcttct 5580 cctgaccaat atgatgatga agctatgagtttgtggacac atgaaaattt tgaagatgat 5640 gaactagata tagagacttc taaatatatcttggatataa taggtgataa gttctgtcaa 5700 ttagtaacat ctgaaaaaac agctttgtcctgggtgaaaa aggatgccaa aattgcctgg 5760 aaaagagcag tgagaggagt ccgggagatgtgtgatgcat gtgaagcaac attgtttaac 5820 attcactggg tctgccaaaa atgtggatttgtggtctgct tagattgtta caaggcaaag 5880 gaaaggaaga gttctagaga taaagaactatatgcttgga tgaagtgtgt gaagggacag 5940 cctcatgatc acaaacattt aatgccaacccaaattatac ctggttctgt tttgacagat 6000 cttctagatg ccatgcacac tcttagggaaaaatatggta ttaaatccca ttgtcattgt 6060 actaacaaac agaatttaca agttggaaattttcctacaa tgaatggtgt atctcaagtt 6120 ttacagaatg ttcttaatca cagtaataaaatttctctgt gcatgcctga gtctcagcag 6180 caaaatactc ctccgaagtc tgagaaaaatggtggcagca gcccagagag tgatgtaggc 6240 acagataaca agttaactcc tccagaatcccagtcaccac tgcactggtt agcagatctt 6300 gcagagcaaa aagccagaga ggaaaaaaaagaaaacaaag aacttaccct tgaaaaccaa 6360 attaaagaag aaagagaaca agacaactctgaatctccaa atggcagaac atcacctctt 6420 gtgtcccaga ataatgaaca aggctcaaccttacgggatt tgctgactac aacagctgga 6480 aagctacgtg tggggtctac agatgctggcattgcctttg ccccagtata ttcaatggga 6540 gccccaagta gcaaaagtgg acggactatgcctaacattc ttgatgacat aattgcttca 6600 gttgttgaaa acaaaattcc accaagtaaaacctccaaga taaatgtaaa accagagctt 6660 aaagaagagc ctgaagaaag cataatatctgcagtggatg aaaataataa attatacagt 6720 gatataccac attcttggat ctgtgagaagcatattttat ggcttaagga ttataagaat 6780 agcagtaatt ggaagctttt caaagaatgttggaaacaag gacagcctgc agtggtttct 6840 ggtgtgcata agaaaatgaa cattagcctatggaaggcgg aatcaattag tcttgatttt 6900 ggagaccacc aagctgatct cctgaactgcaaagatagca tcatttcaaa tgccaatgtt 6960 aaggaattct gggatggttt tgaagaagtttcaaaacggc agaaaaacaa gagtggagaa 7020 acagttgttt taaaattgaa agactggccttcaggagaag acttcaagac tatgatgcca 7080 gcaagatacg aagatctttt aaaaagtctgccattgccag aatattgtaa tccagaagga 7140 aaattcaatt tggcctctca tttgccaggattttttgtac gtcctgatct aggacccagg 7200 ttgtgcagtg cctatggtgt agttgctgctaaagatcatg atataggaac aacaaatctc 7260 catattgaag tttctgatgt tgtaaatatactagtttatg ttggcatagc aaaaggaaat 7320 ggcattctct caaaagcagg aattctcaagaaatttgagg aagaagattt ggatgacatt 7380 ttaaggaaaa gattgaagga ctcaagtgaaatacctggtg ctctgtggca tatttatgct 7440 gggaaagatg ttgacaagat aagggaatttcttcaaaaga tttcaaaaga acaaggcctt 7500 gaagttctac cagaacatga tccaatacgtgaccaaagtt ggtatgtgaa caaaaagctc 7560 cgtcaaaggc tgcttgaaga atatggagtcagaacctgta ctcttattca gttccttggt 7620 gatgctattg ttttgccagc gggagcacttcatcaggttc agaattttca cagctgtatt 7680 caggtaactg aagattttgt gtctccagaacatcttgtag agtcatttca tttaacacag 7740 gaactgagac ttttgaagga agaaatcaattatgatgata aactacaggt taaaaatatt 7800 ttgtatcatg cagtcaaaga aatggtgagagccttgaaga tacacgagga tgaagtagag 7860 gatatggaag aaaattaagt gtgatccagtttgatatttt taggttgttg aactgggatt 7920 acttaacctt gaatgatgat atgtatgcacactgacttta agcttcataa aaccatcagt 7980 gccaagaaat tctctttgta gtaattacttgttactgaca ccacagcagt atagcatatg 8040 tcacagctcc tgtgattcaa tgttataaaacaagcagaat tttaaaagca gcactatata 8100 gctgttttgt attatagtgt atatgatgtttgtgaaaatg ccagatttaa aatgatgtat 8160 ttatttttgg taaaaaataa aaaattctatgctatattgt tgatcaagtg taaatgtgac 8220 cttgtacagt ttactaaaat tactgatatttttcactaca ttgagacagt tactgtgaga 8280 ataggacaca aacaccagct attgcctgcatctgggaaat tgctgaatcg cacagcagtc 8340 atgtcataat cagaaaatta ctgccaaataattgtaaaat ttgtaaagta taaagtatat 8400 aaagtagata ctaaatacag acacttcaatattttgttga agctattgac tgtacaatta 8460 aacattttca aaaggtgtaa tttatttaaaattgtctcat tttggtaaaa tttatgtgaa 8520 cttttaaagc taaatattaa acttaatatgctatgtaaat atatacatat atacatttaa 8580 tgatgtattt ttttaaaaca ttggcttgcttttgttaaag tgcaagtgtt acatatggct 8640 ttgtacatta aagttgaaag gggttttacattttccatta aaaggacttt atc 8693 3 8656 DNA Homo sapiens 3 cacaaggctgcgaatctggg agtccaaagg agtgtttgcg gggagggccc gaggtcgctg 60 tgtccggggcagagcggccg gttcgtcccg agtctgcgct ctttcgggtc cgctgctgtg 120 tcccggtccgctctcctcag catgagcggc cggaggcagc ggccgcttcg gcagcgacag 180 ctatggcggtagagacgcgg gcagagctgg tgggtaagcg gttcctgtgt gtggcggtcg 240 gcgacgaggcacgttcggag cgctgggaga gcggacgcgg ctggcgaagc tggcgagcgg 300 gggtcatccgagccgtgtca cacagggaca gccgcaatcc ggacctggcg atgattcaaa 360 taagtggaaatggaaactct tcaggtgtgt atgtggaatt tgatgatctt gaatgggata 420 aacgagagtgggttaaagtt tatgaagatt tttcaacttt cttggtggaa taccacttaa 480 tctgggccaaaaggaatgac cctagccaga ctcagggatc aaagagcaaa cagattcagt 540 ggcctgcattgactttcaaa cctctggttg aaagaaatat acccagttca gtcactgcag 600 tagaattccttgtagataag caactggatt ttttaactga agatagtgcc tttcagccct 660 accaggacgacatagacagc ctaaacccag ttctcaggga caacccgcag cttcatgagg 720 aagtgaaagtctgggtaaag gaacaaaagg ttcaggagat ttttatgcaa ggtccttatt 780 ccttaaatggatacagagtg agagtatata gacaagactc tgccacccag tggtttactg 840 gcataattactcatcatgat ctcttcaccc gcaccatgat cgttatgaat gatcaggtac 900 tagaaccacagaatgtcgat ccttctatgg ttcaaatgac ctttctagat gatgttgttc 960 actctttgttaaaaggtgaa aatattggca ttacatcacg acgcaggtct cgtgccaatc 1020 aaaacgtcaacgctgttcac agccattata cacgtgccca agcaaatagt cccagaccag 1080 caatgaactcccaagctgct gtaccaaaac agaatacaca ccagcaacag caacaaagaa 1140 gtatccgtccaaataagagg aagggctcag atagcagtat accagatgaa gagaagatga 1200 aggaggaaaaatatgattat atatcacgag gagaaaatcc taaaggtaaa aacaaacact 1260 tgatgaataaaagaaggaaa cctgaggagg atgaaaagaa actaaatatg aaaagacttc 1320 gaactgacaatgtttcagac ttttctgaga gcagtgactc agaaaattca aataagagaa 1380 taatagataattcctcagaa cagaagccag agaatgaatt gaaaaataaa aatacttcaa 1440 aaataaatggagaagaagga aaaccccata ataatgagaa ggcaggagaa gagaccctaa 1500 aaaatagccagcctccctgg gatcaaatac aggaagataa aaaacatgaa gaagcagaga 1560 agcggaagtctgttgacact cagcttcaag aagatatgat tattcattcg tcagaacagt 1620 ccacagtttctgatcataat tctaatgatt tacttcctca ggaatgcaat atggataaaa 1680 cacataccatggaattgcta ccaaaggaga agtttgtatc cagaccaccc acaccaaaat 1740 gtgttattgatattacaaat gacactaatt tagaaaaggt ggctcaggaa aactcaagta 1800 cctttggccttcagacactt cagaaaatgg atcctaatgt tagtgattca aaacactcta 1860 ttgcaaatgcaaaattcttg gaaacagcaa aaaaagattc tgaccagagc tgggtcagtg 1920 atgtagttaaagtggatcta acccaatcaa gtgttacaaa tgcttcttca ggaaatgatc 1980 acttgaacatggaaaaagag aagtatgtct cttacatttc tcctttaagt gcagtttctg 2040 tcatggaagataagctgcat aagcgaagtc cacctccaga gactataaaa tctaaactta 2100 atacttcagtagatactcac aagataaaat ccagcccatc acctgaagtt gttaaaccca 2160 aaataactcattctcctgat tctgtaaagt ctaaggccac ttatgtgaac agccaagcta 2220 ctggtgaaagaagattggca aataagatag aacatgagct atcaagatgc agttttcatc 2280 caattcctactcgaagcagt acattagaaa ctacaaagag tcctcttatc attgataaaa 2340 atgagcattttacagtttac agagatcctg cacttattgg gtcagaaaca ggagctaatc 2400 atatttcacctttcctaagc cagcatcctt ttcctcttca ctcctcatct catagaacct 2460 gtttaaatccaggtacccat catcctgcct taactcctgc accccattta ctagccggat 2520 catctagtcaaactccatta cctaccatta acactcatcc tctgactagt ggtccacacc 2580 atgctgttcatcaccctcat ttacttccca ctgtgttacc tggagtgcct actgcctcct 2640 tacttggtggccacccacga ctagagagtg ctcatgccag cagcttgagc cacttagcgc 2700 tagcacaccagcaacaacaa cagttgttac agcaccagtc acctcatctt cttggacaag 2760 cccatccttctgcttcatat aatcagcttg gactttatcc aattatttgg cagtatccaa 2820 atggaacacatgcatactca ggacttggtt tgccttcttc taagtgggtt cacccagaaa 2880 atgcagttaatgctgaagct tcattaagga ggaattctcc cagtccttgg ctacatcagc 2940 ccacccctgtgacctcagca gatggtattg gattacttag tcacattcct gtcagacctt 3000 ccagtgcagagcctcatcgg cctcttaaaa ttacagccca ttccagtcca ccattgacaa 3060 aaactttagtagatcatcat aaggaagaat tagaaagaaa agcttttatg gaaccattac 3120 ggtctgttgcatccacatca gccaaaaatg acctggatct aaataggtca cagactggaa 3180 aagattgtcacttacatagg cattttgtgg atccagtatt aaatcagtta cagaggccac 3240 cccaggagactggagagagg ttaaacaaat acaaagagga acaccgtcga attcttcaag 3300 aaagtattgatgttgctccc tttacaacta aaatcaaggg acttgagggt gagagagaga 3360 attattccagagtggcatca tcatcttcca gtcctaaaag ccatatcatc aaacaagata 3420 tggatgtagaacgctcagta tcagatcttt ataaaatgaa gcactcagtg cctcagagtt 3480 taccccaaagtaactatttc actacattgt ctaatagtgt ggtcaatgaa ccaccaagat 3540 catacccatccaaagaagtt tcaaatattt acggtgataa acagagtaat gcccttgcag 3600 cggcagcagctaatcctcaa actctgactt catttataac atctctttca aagcctccac 3660 ctttgattaaacaccaacca gaaagtgaag gtttagtagg caagatacca gaacatcttc 3720 cacatcagattgcatctcac tcagtaacaa ccttcagaaa tgattgtagg agtcctaccc 3780 atttgacagtttcttctaca aatacactcc gcagtatgcc tgcattacat agagcaccag 3840 tatttcacccaccaatccat cacagcctgg aaagaaagga aggcagctat agtagtcttt 3900 cccctccaactttaactccg gtgatgccag taaatgctgg tggtaaagtt caagaatcac 3960 agaagcctccaactctaata cccgaaccaa aagactccca ggcaaatttt aagagttctt 4020 cagaacagagtttgacggag atgtggagac ctaataataa cctcagcaaa gagaaaactg 4080 aatggcatgtggagaaaagc agcggaaagt tacaggctgc tatggcatct gtcattgtgc 4140 gtccatcttctagtacaaaa actgatagta tgccagcaat gcagttagct tctaaagatc 4200 gagttagtgaaagatcttca gctggggcac ataaaacaga ttgcctcaaa ctagcagaag 4260 ccggagaaactggaagaatc attttgccaa atgtgaattc agacagtgtt cacacaaaat 4320 ctgaaaaaaactttcaggct gtctcacagg gcagtgttcc cagttcagtc atgtctgctg 4380 taaatacgatgtgtaatacc aaaacggatg taatcacatc tgctgccgat actaccagtg 4440 tttccagctggggtggttca gaagtaattt cctctttatc aaataccatt ttggcctcta 4500 catcatcagaatgtgtatct tcaaaaagtg tcagtcagcc agtggctcaa aaacaagaat 4560 gcaaggtcagcaccacagca ccagttacat tagccagtag taagacagga agtgttgttc 4620 aacccagttctgggttctca ggcacaactg attttatcca tttaaaaaag cacaaggcag 4680 cattggctgcagctcagtat aaaagtagta atgccagtga gactgaacct aatgctataa 4740 aaaatcagacactttcagcc tcccttcctc tggatagcac tgtaatctgt agtacaatta 4800 acaaagcaaactctgtagga aatgggcaag cttcccagac aagtcaacca aactaccata 4860 ctaaactgaaaaaggcctgg ctcaccagac actcagaaga agataaaaat actaataaaa 4920 tggaaaattcagggaattct gtatcagaaa ttattaagcc atgttctgtc aacttaatag 4980 cctctacatctagtgatata caaaatagtg tagatagtaa gatcatagtt gataaatatg 5040 taaaagatgataaagtcaac aggagaaaag ccaaaagaac ttatgaatct ggctctgaaa 5100 gtggagactcagatgaaagt gaaagcaagt cagagcaaag gactaaaaga caacctaagc 5160 caacttacaaaaagaagcaa aatgatttgc aaaagagaaa aggtgaaata gaagaagatt 5220 tgaaacccaatggagttctc agcaggagtg ccaaagaaag aagtaaactg aagttgcaaa 5280 gcaacagtaatactggcatt cctcgttcag tattgaaaga ttggcgtaaa gtcaagaagc 5340 tgaagcaaactggggaatcc tttttacagg atgactcctg ctgtgagata gggcctaatt 5400 tacaaaagtgtcgagaatgt agacttattc gcagtaaaaa aggagaagaa ccagctcact 5460 caccagtattttgtagattt tactacttta gacggttgtc atttagtaaa aacggagtag 5520 ttagaatagatggtttctct tctcctgacc aatatgatga tgaagctatg agtttgtgga 5580 cacatgaaaattttgaagat gatgaactag atatagagac ttctaaatat atcttggata 5640 taataggtgataagttctgt caattagtaa catctgaaaa aacagctttg tcctgggtga 5700 aaaaggatgccaaaattgcc tggaaaagag cagtgagagg agtccgggag atgtgtgatg 5760 catgtgaagcaacattgttt aacattcact gggtctgcca aaaatgtgga tttgtggtct 5820 gcttagattgttacaaggca aaggaaagga agagttctag agataaagaa ctatatgctt 5880 ggatgaagtgtgtgaaggga cagcctcatg atcacaaaca tttaatgcca acccaaatta 5940 tacctggttctgttttgaca gatcttctag atgccatgca cactcttagg gaaaaatatg 6000 gtattaaatcccattgtcat tgtactaaca aacagaattt acaagttgga aattttccta 6060 caatgaatggtgtatctcaa gttttacaga atgttcttaa tcacagtaat aaaatttctc 6120 tgtgcatgcctgagtctcag cagcaaaata ctcctccgaa gtctgagaaa aatggtggca 6180 gcagcccagagagtgatgta ggcacagata acaagttaac tcctccagaa tcccagtcac 6240 cactgcactggttagcagat cttgcagagc aaaaagccag agaggaaaaa aaagaaaaca 6300 aagaacttacccttgaaaac caaattaaag aagaaagaga acaagacaac tctgaatctc 6360 caaatggcagaacatcacct cttgtgtccc agaataatga acaaggctca accttacggg 6420 atttgctgactacaacagct ggaaagctac gtgtggggtc tacagatgct ggcattgcct 6480 ttgccccagtatattcaatg ggagccccaa gtagcaaaag tggacggact atgcctaaca 6540 ttcttgatgacataattgct tcagttgttg aaaacaaaat tccaccaagt aaaacctcca 6600 agataaatgtaaaaccagag cttaaagaag agcctgaaga aagcataata tctgcagtgg 6660 atgaaaataataaattatac agtgatatac cacattcttg gatctgtgag aagcatattt 6720 tatggcttaaggattataag aatagcagta attggaagct tttcaaagaa tgttggaaac 6780 aaggacagcctgcagtggtt tctggtgtgc ataagaaaat gaacattagc ctatggaagg 6840 cggaatcaattagtcttgat tttggagacc accaagctga tctcctgaac tgcaaagata 6900 gcatcatttcaaatgccaat gttaaggaat tctgggatgg ttttgaagaa gtttcaaaac 6960 ggcagaaaaacaagagtgga gaaacagttg ttttaaaatt gaaagactgg ccttcaggag 7020 aagacttcaagactatgatg ccagcaagat acgaagatct tttaaaaagt ctgccattgc 7080 cagaatattgtaatccagaa ggaaaattca atttggcctc tcatttgcca ggattttttg 7140 tacgtcctgatctaggaccc aggttgtgca gtgcctatgg tgtagttgct gctaaagatc 7200 atgatataggaacaacaaat ctccatattg aagtttctga tgttgtaaat atactagttt 7260 atgttggcatagcaaaagga aatggcattc tctcaaaagc aggaattctc aagaaatttg 7320 aggaagaagatttggatgac attttaagga aaagattgaa ggactcaagt gaaatacctg 7380 gtgctctgtggcatatttat gctgggaaag atgttgacaa gataagggaa tttcttcaaa 7440 agatttcaaaagaacaaggc cttgaagttc taccagaaca tgatccaata cgtgaccaaa 7500 gttggtatgtgaacaaaaag ctccgtcaaa ggctgcttga agaatatgga gtcagaacct 7560 gtactcttattcagttcctt ggtgatgcta ttgttttgcc agcgggagca cttcatcagg 7620 ttcagaattttcacagctgt attcaggtaa ctgaagattt tgtgtctcca gaacatcttg 7680 tagagtcatttcatttaaca caggaactga gacttttgaa ggaagaaatc aattatgatg 7740 ataaactacaggttaaaaat attttgtatc atgcagtcaa agaaatggtg agagccttga 7800 agatacacgaggatgaagta gaggatatgg aagaaaatta agtgtgatcc agtttgatat 7860 ttttaggttgttgaactggg attacttaac cttgaatgat gatatgtatg cacactgact 7920 ttaagcttcataaaaccatc agtgccaaga aattctcttt gtagtaatta cttgttactg 7980 acaccacagcagtatagcat atgtcacagc tcctgtgatt caatgttata aaacaagcag 8040 aattttaaaagcagcactat atagctgttt tgtattatag tgtatatgat gtttgtgaaa 8100 atgccagatttaaaatgatg tatttatttt tggtaaaaaa taaaaaattc tatgctatat 8160 tgttgatcaagtgtaaatgt gaccttgtac agtttactaa aattactgat atttttcact 8220 acattgagacagttactgtg agaataggac acaaacacca gctattgcct gcatctggga 8280 aattgctgaatcgcacagca gtcatgtcat aatcagaaaa ttactgccaa ataattgtaa 8340 aatttgtaaagtataaagta tataaagtag atactaaata cagacacttc aatattttgt 8400 tgaagctattgactgtacaa ttaaacattt tcaaaaggtg taatttattt aaaattgtct 8460 cattttggtaaaatttatgt gaacttttaa agctaaatat taaacttaat atgctatgta 8520 aatatatacatatatacatt taatgatgta tttttttaaa acattggctt gcttttgtta 8580 aagtgcaagtgttacatatg gctttgtaca ttaaagttga aaggggtttt acattttcca 8640 ttaaaaggactttatc 8656 4 8622 DNA Homo sapiens 4 ttgcggggag ggcccgaggt cgctgtgtccggggcagagc ggccggttcg tcccgagtct 60 gcgctctttc gggtccgctg ctgtgtcccggtccgctctc ctcagcatga gcggccggta 120 ggagtgaggt ttcatcggtt ccctcgcactggaggaggca gcggccgctt cggcagcgac 180 agctatggcg gtagagacgc gggcagagctggtgggtaag cggttcctgt gtgtggcggt 240 cggcgacgag gcacgttcgg agcgctgggagagcggacgc ggctggcgaa gctggcgagc 300 gggggtcatc cgagccgtgt cacacagggacagccgcaat ccggacctgg cggtgtatgt 360 ggaatttgat gatcttgaat gggataaacgagagtgggtt aaagtttatg aagatttttc 420 aactttcttg gtggaatacc acttaatctgggccaaaagg aatgacccta gccagactca 480 gggatcaaag agcaaacaga ttcagtggcctgcattgact ttcaaacctc tggttgaaag 540 aaatataccc agttcagtca ctgcagtagaattccttgta gataagcaac tggatttttt 600 aactgaagat agtgcctttc agccctaccaggacgacata gacagcctaa acccagttct 660 cagggacaac ccgcagcttc atgaggaagtgaaagtctgg gtaaaggaac aaaaggttca 720 ggagattttt atgcaaggtc cttattccttaaatggatac agagtgagag tatatagaca 780 agactctgcc acccagtggt ttactggcataattactcat catgatctct tcacccgcac 840 catgatcgtt atgaatgatc aggtactagaaccacagaat gtcgatcctt ctatggttca 900 aatgaccttt ctagatgatg ttgttcactctttgttaaaa ggtgaaaata ttggcattac 960 atcacgacgc aggtctcgtg ccaatcaaaacgtcaacgct gttcacagcc attatacacg 1020 tgcccaagca aatagtccca gaccagcaatgaactcccaa gctgctgtac caaaacagaa 1080 tacacaccag caacagcaac aaagaagtatccgtccaaat aagaggaagg gctcagatag 1140 cagtatacca gatgaagaga agatgaaggaggaaaaatat gattatatat cacgaggaga 1200 aaatcctaaa ggtaaaaaca aacacttgatgaataaaaga aggaaacctg aggaggatga 1260 aaagaaacta aatatgaaaa gacttcgaactgacaatgtt tcagactttt ctgagagcag 1320 tgactcagaa aattcaaata agagaataatagataattcc tcagaacaga agccagagaa 1380 tgaattgaaa aataaaaata cttcaaaaataaatggagaa gaaggaaaac cccataataa 1440 tgagaaggca ggagaagaga ccctaaaaaatagccagcct ccctgggatc aaatacagga 1500 agataaaaaa catgaagaag cagagaagcggaagtctgtt gacactcagc ttcaagaaga 1560 tatgattatt cattcgtcag aacagtccacagtttctgat cataattcta atgatttact 1620 tcctcaggaa tgcaatatgg ataaaacacataccatggaa ttgctaccaa aggagaagtt 1680 tgtatccaga ccacccacac caaaatgtgttattgatatt acaaatgaca ctaatttaga 1740 aaaggtggct caggaaaact caagtacctttggccttcag acacttcaga aaatggatcc 1800 taatgttagt gattcaaaac actctattgcaaatgcaaaa ttcttggaaa cagcaaaaaa 1860 agattctgac cagagctggg tcagtgatgtagttaaagtg gatctaaccc aatcaagtgt 1920 tacaaatgct tcttcaggaa atgatcacttgaacatggaa aaagagaagt atgtctctta 1980 catttctcct ttaagtgcag tttctgtcatggaagataag ctgcataagc gaagtccacc 2040 tccagagact ataaaatcta aacttaatacttcagtagat actcacaaga taaaatccag 2100 cccatcacct gaagttgtta aacccaaaataactcattct cctgattctg taaagtctaa 2160 ggccacttat gtgaacagcc aagctactggtgaaagaaga ttggcaaata agatagaaca 2220 tgagctatca agatgcagtt ttcatccaattcctactcga agcagtacat tagaaactac 2280 aaagagtcct cttatcattg ataaaaatgagcattttaca gtttacagag atcctgcact 2340 tattgggtca gaaacaggag ctaatcatatttcacctttc ctaagccagc atccttttcc 2400 tcttcactcc tcatctcata gaacctgtttaaatccaggt acccatcatc ctgccttaac 2460 tcctgcaccc catttactag ccggatcatctagtcaaact ccattaccta ccattaacac 2520 tcatcctctg actagtggtc cacaccatgctgttcatcac cctcatttac ttcccactgt 2580 gttacctgga gtgcctactg cctccttacttggtggccac ccacgactag agagtgctca 2640 tgccagcagc ttgagccact tagcgctagcacaccagcaa caacaacagt tgttacagca 2700 ccagtcacct catcttcttg gacaagcccatccttctgct tcatataatc agcttggact 2760 ttatccaatt atttggcagt atccaaatggaacacatgca tactcaggac ttggtttgcc 2820 ttcttctaag tgggttcacc cagaaaatgcagttaatgct gaagcttcat taaggaggaa 2880 ttctcccagt ccttggctac atcagcccacccctgtgacc tcagcagatg gtattggatt 2940 acttagtcac attcctgtca gaccttccagtgcagagcct catcggcctc ttaaaattac 3000 agcccattcc agtccaccat tgacaaaaactttagtagat catcataagg aagaattaga 3060 aagaaaagct tttatggaac cattacggtctgttgcatcc acatcagcca aaaatgacct 3120 ggatctaaat aggtcacaga ctggaaaagattgtcactta cataggcatt ttgtggatcc 3180 agtattaaat cagttacaga ggccaccccaggagactgga gagaggttaa acaaatacaa 3240 agaggaacac cgtcgaattc ttcaagaaagtattgatgtt gctcccttta caactaaaat 3300 caagggactt gagggtgaga gagagaattattccagagtg gcatcatcat cttccagtcc 3360 taaaagccat atcatcaaac aagatatggatgtagaacgc tcagtatcag atctttataa 3420 aatgaagcac tcagtgcctc agagtttaccccaaagtaac tatttcacta cattgtctaa 3480 tagtgtggtc aatgaaccac caagatcatacccatccaaa gaagtttcaa atatttacgg 3540 tgataaacag agtaatgccc ttgcagcggcagcagctaat cctcaaactc tgacttcatt 3600 tataacatct ctttcaaagc ctccacctttgattaaacac caaccagaaa gtgaaggttt 3660 agtaggcaag ataccagaac atcttccacatcagattgca tctcactcag taacaacctt 3720 cagaaatgat tgtaggagtc ctacccatttgacagtttct tctacaaata cactccgcag 3780 tatgcctgca ttacatagag caccagtatttcacccacca atccatcaca gcctggaaag 3840 aaaggaaggc agctatagta gtctttcccctccaacttta actccggtga tgccagtaaa 3900 tgctggtggt aaagttcaag aatcacagaagcctccaact ctaatacccg aaccaaaaga 3960 ctcccaggca aattttaaga gttcttcagaacagagtttg acggagatgt ggagacctaa 4020 taataacctc agcaaagaga aaactgaatggcatgtggag aaaagcagcg gaaagttaca 4080 ggctgctatg gcatctgtca ttgtgcgtccatcttctagt acaaaaactg atagtatgcc 4140 agcaatgcag ttagcttcta aagatcgagttagtgaaaga tcttcagctg gggcacataa 4200 aacagattgc ctcaaactag cagaagccggagaaactgga agaatcattt tgccaaatgt 4260 gaattcagac agtgttcaca caaaatctgaaaaaaacttt caggctgtct cacagggcag 4320 tgttcccagt tcagtcatgt ctgctgtaaatacgatgtgt aataccaaaa cggatgtaat 4380 cacatctgct gccgatacta ccagtgtttccagctggggt ggttcagaag taatttcctc 4440 tttatcaaat accattttgg cctctacatcatcagaatgt gtatcttcaa aaagtgtcag 4500 tcagccagtg gctcaaaaac aagaatgcaaggtcagcacc acagcaccag ttacattagc 4560 cagtagtaag acaggaagtg ttgttcaacccagttctggg ttctcaggca caactgattt 4620 tatccattta aaaaagcaca aggcagcattggctgcagct cagtataaaa gtagtaatgc 4680 cagtgagact gaacctaatg ctataaaaaatcagacactt tcagcctccc ttcctctgga 4740 tagcactgta atctgtagta caattaacaaagcaaactct gtaggaaatg ggcaagcttc 4800 ccagacaagt caaccaaact accatactaaactgaaaaag gcctggctca ccagacactc 4860 agaagaagat aaaaatacta ataaaatggaaaattcaggg aattctgtat cagaaattat 4920 taagccatgt tctgtcaact taatagcctctacatctagt gatatacaaa atagtgtaga 4980 tagtaagatc atagttgata aatatgtaaaagatgataaa gtcaacagga gaaaagccaa 5040 aagaacttat gaatctggct ctgaaagtggagactcagat gaaagtgaaa gcaagtcaga 5100 gcaaaggact aaaagacaac ctaagccaacttacaaaaag aagcaaaatg atttgcaaaa 5160 gagaaaaggt gaaatagaag aagatttgaaacccaatgga gttctcagca ggagtgccaa 5220 agaaagaagt aaactgaagt tgcaaagcaacagtaatact ggcattcctc gttcagtatt 5280 gaaagattgg cgtaaagtca agaagctgaagcaaactggg gaatcctttt tacaggatga 5340 ctcctgctgt gagatagggc ctaatttacaaaagtgtcga gaatgtagac ttattcgcag 5400 taaaaaagga gaagaaccag ctcactcaccagtattttgt agattttact actttagacg 5460 gttgtcattt agtaaaaacg gagtagttagaatagatggt ttctcttctc ctgaccaata 5520 tgatgatgaa gctatgagtt tgtggacacatgaaaatttt gaagatgatg aactagatat 5580 agagacttct aaatatatct tggatataataggtgataag ttctgtcaat tagtaacatc 5640 tgaaaaaaca gctttgtcct gggtgaaaaaggatgccaaa attgcctgga aaagagcagt 5700 gagaggagtc cgggagatgt gtgatgcatgtgaagcaaca ttgtttaaca ttcactgggt 5760 ctgccaaaaa tgtggatttg tggtctgcttagattgttac aaggcaaagg aaaggaagag 5820 ttctagagat aaagaactat atgcttggatgaagtgtgtg aagggacagc ctcatgatca 5880 caaacattta atgccaaccc aaattatacctggttctgtt ttgacagatc ttctagatgc 5940 catgcacact cttagggaaa aatatggtattaaatcccat tgtcattgta ctaacaaaca 6000 gaatttacaa gttggaaatt ttcctacaatgaatggtgta tctcaagttt tacagaatgt 6060 tcttaatcac agtaataaaa tttctctgtgcatgcctgag tctcagcagc aaaatactcc 6120 tccgaagtct gagaaaaatg gtggcagcagcccagagagt gatgtaggca cagataacaa 6180 gttaactcct ccagaatccc agtcaccactgcactggtta gcagatcttg cagagcaaaa 6240 agccagagag gaaaaaaaag aaaacaaagaacttaccctt gaaaaccaaa ttaaagaaga 6300 aagagaacaa gacaactctg aatctccaaatggcagaaca tcacctcttg tgtcccagaa 6360 taatgaacaa ggctcaacct tacgggatttgctgactaca acagctggaa agctacgtgt 6420 ggggtctaca gatgctggca ttgcctttgccccagtatat tcaatgggag ccccaagtag 6480 caaaagtgga cggactatgc ctaacattcttgatgacata attgcttcag ttgttgaaaa 6540 caaaattcca ccaagtaaaa cctccaagataaatgtaaaa ccagagctta aagaagagcc 6600 tgaagaaagc ataatatctg cagtggatgaaaataataaa ttatacagtg atataccaca 6660 ttcttggatc tgtgagaagc atattttatggcttaaggat tataagaata gcagtaattg 6720 gaagcttttc aaagaatgtt ggaaacaaggacagcctgca gtggtttctg gtgtgcataa 6780 gaaaatgaac attagcctat ggaaggcggaatcaattagt cttgattttg gagaccacca 6840 agctgatctc ctgaactgca aagatagcatcatttcaaat gccaatgtta aggaattctg 6900 ggatggtttt gaagaagttt caaaacggcagaaaaacaag agtggagaaa cagttgtttt 6960 aaaattgaaa gactggcctt caggagaagacttcaagact atgatgccag caagatacga 7020 agatctttta aaaagtctgc cattgccagaatattgtaat ccagaaggaa aattcaattt 7080 ggcctctcat ttgccaggat tttttgtacgtcctgatcta ggacccaggt tgtgcagtgc 7140 ctatggtgta gttgctgcta aagatcatgatataggaaca acaaatctcc atattgaagt 7200 ttctgatgtt gtaaatatac tagtttatgttggcatagca aaaggaaatg gcattctctc 7260 aaaagcagga attctcaaga aatttgaggaagaagatttg gatgacattt taaggaaaag 7320 attgaaggac tcaagtgaaa tacctggtgctctgtggcat atttatgctg ggaaagatgt 7380 tgacaagata agggaatttc ttcaaaagatttcaaaagaa caaggccttg aagttctacc 7440 agaacatgat ccaatacgtg accaaagttggtatgtgaac aaaaagctcc gtcaaaggct 7500 gcttgaagaa tatggagtca gaacctgtactcttattcag ttccttggtg atgctattgt 7560 tttgccagcg ggagcacttc atcaggttcagaattttcac agctgtattc aggtaactga 7620 agattttgtg tctccagaac atcttgtagagtcatttcat ttaacacagg aactgagact 7680 tttgaaggaa gaaatcaatt atgatgataaactacaggtt aaaaatattt tgtatcatgc 7740 agtcaaagaa atggtgagag ccttgaagatacacgaggat gaagtagagg atatggaaga 7800 aaattaagtg tgatccagtt tgatatttttaggttgttga actgggatta cttaaccttg 7860 aatgatgata tgtatgcaca ctgactttaagcttcataaa accatcagtg ccaagaaatt 7920 ctctttgtag taattacttg ttactgacaccacagcagta tagcatatgt cacagctcct 7980 gtgattcaat gttataaaac aagcagaattttaaaagcag cactatatag ctgttttgta 8040 ttatagtgta tatgatgttt gtgaaaatgccagatttaaa atgatgtatt tatttttggt 8100 aaaaaataaa aaattctatg ctatattgttgatcaagtgt aaatgtgacc ttgtacagtt 8160 tactaaaatt actgatattt ttcactacattgagacagtt actgtgagaa taggacacaa 8220 acaccagcta ttgcctgcat ctgggaaattgctgaatcgc acagcagtca tgtcataatc 8280 agaaaattac tgccaaataa ttgtaaaatttgtaaagtat aaagtatata aagtagatac 8340 taaatacaga cacttcaata ttttgttgaagctattgact gtacaattaa acattttcaa 8400 aaggtgtaat ttatttaaaa ttgtctcattttggtaaaat ttatgtgaac ttttaaagct 8460 aaatattaaa cttaatatgc tatgtaaatatatacatata tacatttaat gatgtatttt 8520 tttaaaacat tggcttgctt ttgttaaagtgcaagtgtta catatggctt tgtacattaa 8580 agttgaaagg ggttttacat tttccattaaaaggacttta tc 8622 5 8585 DNA Homo sapiens 5 ttgcggggag ggcccgaggtcgctgtgtcc ggggcagagc ggccggttcg tcccgagtct 60 gcgctctttc gggtccgctgctgtgtcccg gtccgctctc ctcagcatga gcggccggag 120 gcagcggccg cttcggcagcgacagctatg gcggtagaga cgcgggcaga gctggtgggt 180 aagcggttcc tgtgtgtggcggtcggcgac gaggcacgtt cggagcgctg ggagagcgga 240 cgcggctggc gaagctggcgagcgggggtc atccgagccg tgtcacacag ggacagccgc 300 aatccggacc tggcggtgtatgtggaattt gatgatcttg aatgggataa acgagagtgg 360 gttaaagttt atgaagatttttcaactttc ttggtggaat accacttaat ctgggccaaa 420 aggaatgacc ctagccagactcagggatca aagagcaaac agattcagtg gcctgcattg 480 actttcaaac ctctggttgaaagaaatata cccagttcag tcactgcagt agaattcctt 540 gtagataagc aactggattttttaactgaa gatagtgcct ttcagcccta ccaggacgac 600 atagacagcc taaacccagttctcagggac aacccgcagc ttcatgagga agtgaaagtc 660 tgggtaaagg aacaaaaggttcaggagatt tttatgcaag gtccttattc cttaaatgga 720 tacagagtga gagtatatagacaagactct gccacccagt ggtttactgg cataattact 780 catcatgatc tcttcacccgcaccatgatc gttatgaatg atcaggtact agaaccacag 840 aatgtcgatc cttctatggttcaaatgacc tttctagatg atgttgttca ctctttgtta 900 aaaggtgaaa atattggcattacatcacga cgcaggtctc gtgccaatca aaacgtcaac 960 gctgttcaca gccattatacacgtgcccaa gcaaatagtc ccagaccagc aatgaactcc 1020 caagctgctg taccaaaacagaatacacac cagcaacagc aacaaagaag tatccgtcca 1080 aataagagga agggctcagatagcagtata ccagatgaag agaagatgaa ggaggaaaaa 1140 tatgattata tatcacgaggagaaaatcct aaaggtaaaa acaaacactt gatgaataaa 1200 agaaggaaac ctgaggaggatgaaaagaaa ctaaatatga aaagacttcg aactgacaat 1260 gtttcagact tttctgagagcagtgactca gaaaattcaa ataagagaat aatagataat 1320 tcctcagaac agaagccagagaatgaattg aaaaataaaa atacttcaaa aataaatgga 1380 gaagaaggaa aaccccataataatgagaag gcaggagaag agaccctaaa aaatagccag 1440 cctccctggg atcaaatacaggaagataaa aaacatgaag aagcagagaa gcggaagtct 1500 gttgacactc agcttcaagaagatatgatt attcattcgt cagaacagtc cacagtttct 1560 gatcataatt ctaatgatttacttcctcag gaatgcaata tggataaaac acataccatg 1620 gaattgctac caaaggagaagtttgtatcc agaccaccca caccaaaatg tgttattgat 1680 attacaaatg acactaatttagaaaaggtg gctcaggaaa actcaagtac ctttggcctt 1740 cagacacttc agaaaatggatcctaatgtt agtgattcaa aacactctat tgcaaatgca 1800 aaattcttgg aaacagcaaaaaaagattct gaccagagct gggtcagtga tgtagttaaa 1860 gtggatctaa cccaatcaagtgttacaaat gcttcttcag gaaatgatca cttgaacatg 1920 gaaaaagaga agtatgtctcttacatttct cctttaagtg cagtttctgt catggaagat 1980 aagctgcata agcgaagtccacctccagag actataaaat ctaaacttaa tacttcagta 2040 gatactcaca agataaaatccagcccatca cctgaagttg ttaaacccaa aataactcat 2100 tctcctgatt ctgtaaagtctaaggccact tatgtgaaca gccaagctac tggtgaaaga 2160 agattggcaa ataagatagaacatgagcta tcaagatgca gttttcatcc aattcctact 2220 cgaagcagta cattagaaactacaaagagt cctcttatca ttgataaaaa tgagcatttt 2280 acagtttaca gagatcctgcacttattggg tcagaaacag gagctaatca tatttcacct 2340 ttcctaagcc agcatccttttcctcttcac tcctcatctc atagaacctg tttaaatcca 2400 ggtacccatc atcctgccttaactcctgca ccccatttac tagccggatc atctagtcaa 2460 actccattac ctaccattaacactcatcct ctgactagtg gtccacacca tgctgttcat 2520 caccctcatt tacttcccactgtgttacct ggagtgccta ctgcctcctt acttggtggc 2580 cacccacgac tagagagtgctcatgccagc agcttgagcc acttagcgct agcacaccag 2640 caacaacaac agttgttacagcaccagtca cctcatcttc ttggacaagc ccatccttct 2700 gcttcatata atcagcttggactttatcca attatttggc agtatccaaa tggaacacat 2760 gcatactcag gacttggtttgccttcttct aagtgggttc acccagaaaa tgcagttaat 2820 gctgaagctt cattaaggaggaattctccc agtccttggc tacatcagcc cacccctgtg 2880 acctcagcag atggtattggattacttagt cacattcctg tcagaccttc cagtgcagag 2940 cctcatcggc ctcttaaaattacagcccat tccagtccac cattgacaaa aactttagta 3000 gatcatcata aggaagaattagaaagaaaa gcttttatgg aaccattacg gtctgttgca 3060 tccacatcag ccaaaaatgacctggatcta aataggtcac agactggaaa agattgtcac 3120 ttacataggc attttgtggatccagtatta aatcagttac agaggccacc ccaggagact 3180 ggagagaggt taaacaaatacaaagaggaa caccgtcgaa ttcttcaaga aagtattgat 3240 gttgctccct ttacaactaaaatcaaggga cttgagggtg agagagagaa ttattccaga 3300 gtggcatcat catcttccagtcctaaaagc catatcatca aacaagatat ggatgtagaa 3360 cgctcagtat cagatctttataaaatgaag cactcagtgc ctcagagttt accccaaagt 3420 aactatttca ctacattgtctaatagtgtg gtcaatgaac caccaagatc atacccatcc 3480 aaagaagttt caaatatttacggtgataaa cagagtaatg cccttgcagc ggcagcagct 3540 aatcctcaaa ctctgacttcatttataaca tctctttcaa agcctccacc tttgattaaa 3600 caccaaccag aaagtgaaggtttagtaggc aagataccag aacatcttcc acatcagatt 3660 gcatctcact cagtaacaaccttcagaaat gattgtagga gtcctaccca tttgacagtt 3720 tcttctacaa atacactccgcagtatgcct gcattacata gagcaccagt atttcaccca 3780 ccaatccatc acagcctggaaagaaaggaa ggcagctata gtagtctttc ccctccaact 3840 ttaactccgg tgatgccagtaaatgctggt ggtaaagttc aagaatcaca gaagcctcca 3900 actctaatac ccgaaccaaaagactcccag gcaaatttta agagttcttc agaacagagt 3960 ttgacggaga tgtggagacctaataataac ctcagcaaag agaaaactga atggcatgtg 4020 gagaaaagca gcggaaagttacaggctgct atggcatctg tcattgtgcg tccatcttct 4080 agtacaaaaa ctgatagtatgccagcaatg cagttagctt ctaaagatcg agttagtgaa 4140 agatcttcag ctggggcacataaaacagat tgcctcaaac tagcagaagc cggagaaact 4200 ggaagaatca ttttgccaaatgtgaattca gacagtgttc acacaaaatc tgaaaaaaac 4260 tttcaggctg tctcacagggcagtgttccc agttcagtca tgtctgctgt aaatacgatg 4320 tgtaatacca aaacggatgtaatcacatct gctgccgata ctaccagtgt ttccagctgg 4380 ggtggttcag aagtaatttcctctttatca aataccattt tggcctctac atcatcagaa 4440 tgtgtatctt caaaaagtgtcagtcagcca gtggctcaaa aacaagaatg caaggtcagc 4500 accacagcac cagttacattagccagtagt aagacaggaa gtgttgttca acccagttct 4560 gggttctcag gcacaactgattttatccat ttaaaaaagc acaaggcagc attggctgca 4620 gctcagtata aaagtagtaatgccagtgag actgaaccta atgctataaa aaatcagaca 4680 ctttcagcct cccttcctctggatagcact gtaatctgta gtacaattaa caaagcaaac 4740 tctgtaggaa atgggcaagcttcccagaca agtcaaccaa actaccatac taaactgaaa 4800 aaggcctggc tcaccagacactcagaagaa gataaaaata ctaataaaat ggaaaattca 4860 gggaattctg tatcagaaattattaagcca tgttctgtca acttaatagc ctctacatct 4920 agtgatatac aaaatagtgtagatagtaag atcatagttg ataaatatgt aaaagatgat 4980 aaagtcaaca ggagaaaagccaaaagaact tatgaatctg gctctgaaag tggagactca 5040 gatgaaagtg aaagcaagtcagagcaaagg actaaaagac aacctaagcc aacttacaaa 5100 aagaagcaaa atgatttgcaaaagagaaaa ggtgaaatag aagaagattt gaaacccaat 5160 ggagttctca gcaggagtgccaaagaaaga agtaaactga agttgcaaag caacagtaat 5220 actggcattc ctcgttcagtattgaaagat tggcgtaaag tcaagaagct gaagcaaact 5280 ggggaatcct ttttacaggatgactcctgc tgtgagatag ggcctaattt acaaaagtgt 5340 cgagaatgta gacttattcgcagtaaaaaa ggagaagaac cagctcactc accagtattt 5400 tgtagatttt actactttagacggttgtca tttagtaaaa acggagtagt tagaatagat 5460 ggtttctctt ctcctgaccaatatgatgat gaagctatga gtttgtggac acatgaaaat 5520 tttgaagatg atgaactagatatagagact tctaaatata tcttggatat aataggtgat 5580 aagttctgtc aattagtaacatctgaaaaa acagctttgt cctgggtgaa aaaggatgcc 5640 aaaattgcct ggaaaagagcagtgagagga gtccgggaga tgtgtgatgc atgtgaagca 5700 acattgttta acattcactgggtctgccaa aaatgtggat ttgtggtctg cttagattgt 5760 tacaaggcaa aggaaaggaagagttctaga gataaagaac tatatgcttg gatgaagtgt 5820 gtgaagggac agcctcatgatcacaaacat ttaatgccaa cccaaattat acctggttct 5880 gttttgacag atcttctagatgccatgcac actcttaggg aaaaatatgg tattaaatcc 5940 cattgtcatt gtactaacaaacagaattta caagttggaa attttcctac aatgaatggt 6000 gtatctcaag ttttacagaatgttcttaat cacagtaata aaatttctct gtgcatgcct 6060 gagtctcagc agcaaaatactcctccgaag tctgagaaaa atggtggcag cagcccagag 6120 agtgatgtag gcacagataacaagttaact cctccagaat cccagtcacc actgcactgg 6180 ttagcagatc ttgcagagcaaaaagccaga gaggaaaaaa aagaaaacaa agaacttacc 6240 cttgaaaacc aaattaaagaagaaagagaa caagacaact ctgaatctcc aaatggcaga 6300 acatcacctc ttgtgtcccagaataatgaa caaggctcaa ccttacggga tttgctgact 6360 acaacagctg gaaagctacgtgtggggtct acagatgctg gcattgcctt tgccccagta 6420 tattcaatgg gagccccaagtagcaaaagt ggacggacta tgcctaacat tcttgatgac 6480 ataattgctt cagttgttgaaaacaaaatt ccaccaagta aaacctccaa gataaatgta 6540 aaaccagagc ttaaagaagagcctgaagaa agcataatat ctgcagtgga tgaaaataat 6600 aaattataca gtgatataccacattcttgg atctgtgaga agcatatttt atggcttaag 6660 gattataaga atagcagtaattggaagctt ttcaaagaat gttggaaaca aggacagcct 6720 gcagtggttt ctggtgtgcataagaaaatg aacattagcc tatggaaggc ggaatcaatt 6780 agtcttgatt ttggagaccaccaagctgat ctcctgaact gcaaagatag catcatttca 6840 aatgccaatg ttaaggaattctgggatggt tttgaagaag tttcaaaacg gcagaaaaac 6900 aagagtggag aaacagttgttttaaaattg aaagactggc cttcaggaga agacttcaag 6960 actatgatgc cagcaagatacgaagatctt ttaaaaagtc tgccattgcc agaatattgt 7020 aatccagaag gaaaattcaatttggcctct catttgccag gattttttgt acgtcctgat 7080 ctaggaccca ggttgtgcagtgcctatggt gtagttgctg ctaaagatca tgatatagga 7140 acaacaaatc tccatattgaagtttctgat gttgtaaata tactagttta tgttggcata 7200 gcaaaaggaa atggcattctctcaaaagca ggaattctca agaaatttga ggaagaagat 7260 ttggatgaca ttttaaggaaaagattgaag gactcaagtg aaatacctgg tgctctgtgg 7320 catatttatg ctgggaaagatgttgacaag ataagggaat ttcttcaaaa gatttcaaaa 7380 gaacaaggcc ttgaagttctaccagaacat gatccaatac gtgaccaaag ttggtatgtg 7440 aacaaaaagc tccgtcaaaggctgcttgaa gaatatggag tcagaacctg tactcttatt 7500 cagttccttg gtgatgctattgttttgcca gcgggagcac ttcatcaggt tcagaatttt 7560 cacagctgta ttcaggtaactgaagatttt gtgtctccag aacatcttgt agagtcattt 7620 catttaacac aggaactgagacttttgaag gaagaaatca attatgatga taaactacag 7680 gttaaaaata ttttgtatcatgcagtcaaa gaaatggtga gagccttgaa gatacacgag 7740 gatgaagtag aggatatggaagaaaattaa gtgtgatcca gtttgatatt tttaggttgt 7800 tgaactggga ttacttaaccttgaatgatg atatgtatgc acactgactt taagcttcat 7860 aaaaccatca gtgccaagaaattctctttg tagtaattac ttgttactga caccacagca 7920 gtatagcata tgtcacagctcctgtgattc aatgttataa aacaagcaga attttaaaag 7980 cagcactata tagctgttttgtattatagt gtatatgatg tttgtgaaaa tgccagattt 8040 aaaatgatgt atttatttttggtaaaaaat aaaaaattct atgctatatt gttgatcaag 8100 tgtaaatgtg accttgtacagtttactaaa attactgata tttttcacta cattgagaca 8160 gttactgtga gaataggacacaaacaccag ctattgcctg catctgggaa attgctgaat 8220 cgcacagcag tcatgtcataatcagaaaat tactgccaaa taattgtaaa atttgtaaag 8280 tataaagtat ataaagtagatactaaatac agacacttca atattttgtt gaagctattg 8340 actgtacaat taaacattttcaaaaggtgt aatttattta aaattgtctc attttggtaa 8400 aatttatgtg aacttttaaagctaaatatt aaacttaata tgctatgtaa atatatacat 8460 atatacattt aatgatgtatttttttaaaa cattggcttg cttttgttaa agtgcaagtg 8520 ttacatatgg ctttgtacattaaagttgaa aggggtttta cattttccat taaaaggact 8580 ttatc 8585 6 8552 DNAHomo sapiens 6 ttgcggggag ggcccgaggt cgctgtgtcc ggggcagagc ggccggttcgtcccgagtct 60 gcgctctttc gggtccgctg ctgtgtcccg gtccgctctc ctcagcatgagcggccggta 120 ggagtgaggt ttcatcggtt ccctcgcact ggaggaggca gcggccgcttcggcagcgac 180 agctatggcg gtagagacgc gggcagagct ggtgggtaag cggttcctgtgtgtggcggt 240 cggcgacgag gcacgttcgg agcgctggga gagcggacgc ggctggcgaagctggcgagc 300 gggggtcatc cgagccgtgt cacacaggga cagccgcaat ccggacctggcgatgattca 360 aataagtgga aatggaaact cttcaggtgt gtatgtggaa tttgatgatcttgaatggga 420 taaacgagag tgggttaaag tttatgaaga tttttcaact ttcttggtggaataccactt 480 aatctgggcc aaaaggaatg accctagcca gactcaggga tcaaagagcaaacagattca 540 gtggcctgca ttgactttca aacctctggt tgaaagaaat atacccagttcagtcactgc 600 agtagaattc cttgtagata agcaactgga ttttttaact gaagatagtgcctttcagcc 660 ctaccaggtc cttattcctt aaatggatac agagtgagag tatatagacaagactctgcc 720 acccagtggt ttactggcat aattactcat catgatctct tcacccgcaccatgatcgtt 780 atgaatgatc aggtactaga accacagaat gtcgatcctt ctatggttcaaatgaccttt 840 ctagatgatg ttgttcactc tttgttaaaa ggtgaaaata ttggcattacatcacgacgc 900 aggtctcgtg ccaatcaaaa cgtcaacgct gttcacagcc attatacacgtgcccaagca 960 aatagtccca gaccagcaat gaactcccaa gctgctgtac caaaacagaatacacaccag 1020 caacagcaac aaagaagtat ccgtccaaat aagaggaagg gctcagatagcagtatacca 1080 gatgaagaga agatgaagga ggaaaaatat gattatatat cacgaggagaaaatcctaaa 1140 ggtaaaaaca aacacttgat gaataaaaga aggaaacctg aggaggatgaaaagaaacta 1200 aatatgaaaa gacttcgaac tgacaatgtt tcagactttt ctgagagcagtgactcagaa 1260 aattcaaata agagaataat agataattcc tcagaacaga agccagagaatgaattgaaa 1320 aataaaaata cttcaaaaat aaatggagaa gaaggaaaac cccataataatgagaaggca 1380 ggagaagaga ccctaaaaaa tagccagcct ccctgggatc aaatacaggaagataaaaaa 1440 catgaagaag cagagaagcg gaagtctgtt gacactcagc ttcaagaagatatgattatt 1500 cattcgtcag aacagtccac agtttctgat cataattcta atgatttacttcctcaggaa 1560 tgcaatatgg ataaaacaca taccatggaa ttgctaccaa aggagaagtttgtatccaga 1620 ccacccacac caaaatgtgt tattgatatt acaaatgaca ctaatttagaaaaggtggct 1680 caggaaaact caagtacctt tggccttcag acacttcaga aaatggatcctaatgttagt 1740 gattcaaaac actctattgc aaatgcaaaa ttcttggaaa cagcaaaaaaagattctgac 1800 cagagctggg tcagtgatgt agttaaagtg gatctaaccc aatcaagtgttacaaatgct 1860 tcttcaggaa atgatcactt gaacatggaa aaagagaagt atgtctcttacatttctcct 1920 ttaagtgcag tttctgtcat ggaagataag ctgcataagc gaagtccacctccagagact 1980 ataaaatcta aacttaatac ttcagtagat actcacaaga taaaatccagcccatcacct 2040 gaagttgtta aacccaaaat aactcattct cctgattctg taaagtctaaggccacttat 2100 gtgaacagcc aagctactgg tgaaagaaga ttggcaaata agatagaacatgagctatca 2160 agatgcagtt ttcatccaat tcctactcga agcagtacat tagaaactacaaagagtcct 2220 cttatcattg ataaaaatga gcattttaca gtttacagag atcctgcacttattgggtca 2280 gaaacaggag ctaatcatat ttcacctttc ctaagccagc atccttttcctcttcactcc 2340 tcatctcata gaacctgttt aaatccaggt acccatcatc ctgccttaactcctgcaccc 2400 catttactag ccggatcatc tagtcaaact ccattaccta ccattaacactcatcctctg 2460 actagtggtc cacaccatgc tgttcatcac cctcatttac ttcccactgtgttacctgga 2520 gtgcctactg cctccttact tggtggccac ccacgactag agagtgctcatgccagcagc 2580 ttgagccact tagcgctagc acaccagcaa caacaacagt tgttacagcaccagtcacct 2640 catcttcttg gacaagccca tccttctgct tcatataatc agcttggactttatccaatt 2700 atttggcagt atccaaatgg aacacatgca tactcaggac ttggtttgccttcttctaag 2760 tgggttcacc cagaaaatgc agttaatgct gaagcttcat taaggaggaattctcccagt 2820 ccttggctac atcagcccac ccctgtgacc tcagcagatg gtattggattacttagtcac 2880 attcctgtca gaccttccag tgcagagcct catcggcctc ttaaaattacagcccattcc 2940 agtccaccat tgacaaaaac tttagtagat catcataagg aagaattagaaagaaaagct 3000 tttatggaac cattacggtc tgttgcatcc acatcagcca aaaatgacctggatctaaat 3060 aggtcacaga ctggaaaaga ttgtcactta cataggcatt ttgtggatccagtattaaat 3120 cagttacaga ggccacccca ggagactgga gagaggttaa acaaatacaaagaggaacac 3180 cgtcgaattc ttcaagaaag tattgatgtt gctcccttta caactaaaatcaagggactt 3240 gagggtgaga gagagaatta ttccagagtg gcatcatcat cttccagtcctaaaagccat 3300 atcatcaaac aagatatgga tgtagaacgc tcagtatcag atctttataaaatgaagcac 3360 tcagtgcctc agagtttacc ccaaagtaac tatttcacta cattgtctaatagtgtggtc 3420 aatgaaccac caagatcata cccatccaaa gaagtttcaa atatttacggtgataaacag 3480 agtaatgccc ttgcagcggc agcagctaat cctcaaactc tgacttcatttataacatct 3540 ctttcaaagc ctccaccttt gattaaacac caaccagaaa gtgaaggtttagtaggcaag 3600 ataccagaac atcttccaca tcagattgca tctcactcag taacaaccttcagaaatgat 3660 tgtaggagtc ctacccattt gacagtttct tctacaaata cactccgcagtatgcctgca 3720 ttacatagag caccagtatt tcacccacca atccatcaca gcctggaaagaaaggaaggc 3780 agctatagta gtctttcccc tccaacttta actccggtga tgccagtaaatgctggtggt 3840 aaagttcaag aatcacagaa gcctccaact ctaatacccg aaccaaaagactcccaggca 3900 aattttaaga gttcttcaga acagagtttg acggagatgt ggagacctaataataacctc 3960 agcaaagaga aaactgaatg gcatgtggag aaaagcagcg gaaagttacaggctgctatg 4020 gcatctgtca ttgtgcgtcc atcttctagt acaaaaactg atagtatgccagcaatgcag 4080 ttagcttcta aagatcgagt tagtgaaaga tcttcagctg gggcacataaaacagattgc 4140 ctcaaactag cagaagccgg agaaactgga agaatcattt tgccaaatgtgaattcagac 4200 agtgttcaca caaaatctga aaaaaacttt caggctgtct cacagggcagtgttcccagt 4260 tcagtcatgt ctgctgtaaa tacgatgtgt aataccaaaa cggatgtaatcacatctgct 4320 gccgatacta ccagtgtttc cagctggggt ggttcagaag taatttcctctttatcaaat 4380 accattttgg cctctacatc atcagaatgt gtatcttcaa aaagtgtcagtcagccagtg 4440 gctcaaaaac aagaatgcaa ggtcagcacc acagcaccag ttacattagccagtagtaag 4500 acaggaagtg ttgttcaacc cagttctggg ttctcaggca caactgattttatccattta 4560 aaaaagcaca aggcagcatt ggctgcagct cagtataaaa gtagtaatgccagtgagact 4620 gaacctaatg ctataaaaaa tcagacactt tcagcctccc ttcctctggatagcactgta 4680 atctgtagta caattaacaa agcaaactct gtaggaaatg ggcaagcttcccagacaagt 4740 caaccaaact accatactaa actgaaaaag gcctggctca ccagacactcagaagaagat 4800 aaaaatacta ataaaatgga aaattcaggg aattctgtat cagaaattattaagccatgt 4860 tctgtcaact taatagcctc tacatctagt gatatacaaa atagtgtagatagtaagatc 4920 atagttgata aatatgtaaa agatgataaa gtcaacagga gaaaagccaaaagaacttat 4980 gaatctggct ctgaaagtgg agactcagat gaaagtgaaa gcaagtcagagcaaaggact 5040 aaaagacaac ctaagccaac ttacaaaaag aagcaaaatg atttgcaaaagagaaaaggt 5100 gaaatagaag aagatttgaa acccaatgga gttctcagca ggagtgccaaagaaagaagt 5160 aaactgaagt tgcaaagcaa cagtaatact ggcattcctc gttcagtattgaaagattgg 5220 cgtaaagtca agaagctgaa gcaaactggg gaatcctttt tacaggatgactcctgctgt 5280 gagatagggc ctaatttaca aaagtgtcga gaatgtagac ttattcgcagtaaaaaagga 5340 gaagaaccag ctcactcacc agtattttgt agattttact actttagacggttgtcattt 5400 agtaaaaacg gagtagttag aatagatggt ttctcttctc ctgaccaatatgatgatgaa 5460 gctatgagtt tgtggacaca tgaaaatttt gaagatgatg aactagatatagagacttct 5520 aaatatatct tggatataat aggtgataag ttctgtcaat tagtaacatctgaaaaaaca 5580 gctttgtcct gggtgaaaaa ggatgccaaa attgcctgga aaagagcagtgagaggagtc 5640 cgggagatgt gtgatgcatg tgaagcaaca ttgtttaaca ttcactgggtctgccaaaaa 5700 tgtggatttg tggtctgctt agattgttac aaggcaaagg aaaggaagagttctagagat 5760 aaagaactat atgcttggat gaagtgtgtg aagggacagc ctcatgatcacaaacattta 5820 atgccaaccc aaattatacc tggttctgtt ttgacagatc ttctagatgccatgcacact 5880 cttagggaaa aatatggtat taaatcccat tgtcattgta ctaacaaacagaatttacaa 5940 gttggaaatt ttcctacaat gaatggtgta tctcaagttt tacagaatgttcttaatcac 6000 agtaataaaa tttctctgtg catgcctgag tctcagcagc aaaatactcctccgaagtct 6060 gagaaaaatg gtggcagcag cccagagagt gatgtaggca cagataacaagttaactcct 6120 ccagaatccc agtcaccact gcactggtta gcagatcttg cagagcaaaaagccagagag 6180 gaaaaaaaag aaaacaaaga acttaccctt gaaaaccaaa ttaaagaagaaagagaacaa 6240 gacaactctg aatctccaaa tggcagaaca tcacctcttg tgtcccagaataatgaacaa 6300 ggctcaacct tacgggattt gctgactaca acagctggaa agctacgtgtggggtctaca 6360 gatgctggca ttgcctttgc cccagtatat tcaatgggag ccccaagtagcaaaagtgga 6420 cggactatgc ctaacattct tgatgacata attgcttcag ttgttgaaaacaaaattcca 6480 ccaagtaaaa cctccaagat aaatgtaaaa ccagagctta aagaagagcctgaagaaagc 6540 ataatatctg cagtggatga aaataataaa ttatacagtg atataccacattcttggatc 6600 tgtgagaagc atattttatg gcttaaggat tataagaata gcagtaattggaagcttttc 6660 aaagaatgtt ggaaacaagg acagcctgca gtggtttctg gtgtgcataagaaaatgaac 6720 attagcctat ggaaggcgga atcaattagt cttgattttg gagaccaccaagctgatctc 6780 ctgaactgca aagatagcat catttcaaat gccaatgtta aggaattctgggatggtttt 6840 gaagaagttt caaaacggca gaaaaacaag agtggagaaa cagttgttttaaaattgaaa 6900 gactggcctt caggagaaga cttcaagact atgatgccag caagatacgaagatctttta 6960 aaaagtctgc cattgccaga atattgtaat ccagaaggaa aattcaatttggcctctcat 7020 ttgccaggat tttttgtacg tcctgatcta ggacccaggt tgtgcagtgcctatggtgta 7080 gttgctgcta aagatcatga tataggaaca acaaatctcc atattgaagtttctgatgtt 7140 gtaaatatac tagtttatgt tggcatagca aaaggaaatg gcattctctcaaaagcagga 7200 attctcaaga aatttgagga agaagatttg gatgacattt taaggaaaagattgaaggac 7260 tcaagtgaaa tacctggtgc tctgtggcat atttatgctg ggaaagatgttgacaagata 7320 agggaatttc ttcaaaagat ttcaaaagaa caaggccttg aagttctaccagaacatgat 7380 ccaatacgtg accaaagttg gtatgtgaac aaaaagctcc gtcaaaggctgcttgaagaa 7440 tatggagtca gaacctgtac tcttattcag ttccttggtg atgctattgttttgccagcg 7500 ggagcacttc atcaggttca gaattttcac agctgtattc aggtaactgaagattttgtg 7560 tctccagaac atcttgtaga gtcatttcat ttaacacagg aactgagacttttgaaggaa 7620 gaaatcaatt atgatgataa actacaggtt aaaaatattt tgtatcatgcagtcaaagaa 7680 atggtgagag ccttgaagat acacgaggat gaagtagagg atatggaagaaaattaagtg 7740 tgatccagtt tgatattttt aggttgttga actgggatta cttaaccttgaatgatgata 7800 tgtatgcaca ctgactttaa gcttcataaa accatcagtg ccaagaaattctctttgtag 7860 taattacttg ttactgacac cacagcagta tagcatatgt cacagctcctgtgattcaat 7920 gttataaaac aagcagaatt ttaaaagcag cactatatag ctgttttgtattatagtgta 7980 tatgatgttt gtgaaaatgc cagatttaaa atgatgtatt tatttttggtaaaaaataaa 8040 aaattctatg ctatattgtt gatcaagtgt aaatgtgacc ttgtacagtttactaaaatt 8100 actgatattt ttcactacat tgagacagtt actgtgagaa taggacacaaacaccagcta 8160 ttgcctgcat ctgggaaatt gctgaatcgc acagcagtca tgtcataatcagaaaattac 8220 tgccaaataa ttgtaaaatt tgtaaagtat aaagtatata aagtagatactaaatacaga 8280 cacttcaata ttttgttgaa gctattgact gtacaattaa acattttcaaaaggtgtaat 8340 ttatttaaaa ttgtctcatt ttggtaaaat ttatgtgaac ttttaaagctaaatattaaa 8400 cttaatatgc tatgtaaata tatacatata tacatttaat gatgtatttttttaaaacat 8460 tggcttgctt ttgttaaagt gcaagtgtta catatggctt tgtacattaaagttgaaagg 8520 ggttttacat tttccattaa aaggacttta tc 8552 7 8515 DNA Homosapiens 7 ttgcggggag ggcccgaggt cgctgtgtcc ggggcagagc ggccggttcgtcccgagtct 60 gcgctctttc gggtccgctg ctgtgtcccg gtccgctctc ctcagcatgagcggccggag 120 gcagcggccg cttcggcagc gacagctatg gcggtagaga cgcgggcagagctggtgggt 180 aagcggttcc tgtgtgtggc ggtcggcgac gaggcacgtt cggagcgctgggagagcgga 240 cgcggctggc gaagctggcg agcgggggtc atccgagccg tgtcacacagggacagccgc 300 aatccggacc tggcgatgat tcaaataagt ggaaatggaa actcttcaggtgtgtatgtg 360 gaatttgatg atcttgaatg ggataaacga gagtgggtta aagtttatgaagatttttca 420 actttcttgg tggaatacca cttaatctgg gccaaaagga atgaccctagccagactcag 480 ggatcaaaga gcaaacagat tcagtggcct gcattgactt tcaaacctctggttgaaaga 540 aatataccca gttcagtcac tgcagtagaa ttccttgtag ataagcaactggatttttta 600 actgaagata gtgcctttca gccctaccag gtccttattc cttaaatggatacagagtga 660 gagtatatag acaagactct gccacccagt ggtttactgg cataattactcatcatgatc 720 tcttcacccg caccatgatc gttatgaatg atcaggtact agaaccacagaatgtcgatc 780 cttctatggt tcaaatgacc tttctagatg atgttgttca ctctttgttaaaaggtgaaa 840 atattggcat tacatcacga cgcaggtctc gtgccaatca aaacgtcaacgctgttcaca 900 gccattatac acgtgcccaa gcaaatagtc ccagaccagc aatgaactcccaagctgctg 960 taccaaaaca gaatacacac cagcaacagc aacaaagaag tatccgtccaaataagagga 1020 agggctcaga tagcagtata ccagatgaag agaagatgaa ggaggaaaaatatgattata 1080 tatcacgagg agaaaatcct aaaggtaaaa acaaacactt gatgaataaaagaaggaaac 1140 ctgaggagga tgaaaagaaa ctaaatatga aaagacttcg aactgacaatgtttcagact 1200 tttctgagag cagtgactca gaaaattcaa ataagagaat aatagataattcctcagaac 1260 agaagccaga gaatgaattg aaaaataaaa atacttcaaa aataaatggagaagaaggaa 1320 aaccccataa taatgagaag gcaggagaag agaccctaaa aaatagccagcctccctggg 1380 atcaaataca ggaagataaa aaacatgaag aagcagagaa gcggaagtctgttgacactc 1440 agcttcaaga agatatgatt attcattcgt cagaacagtc cacagtttctgatcataatt 1500 ctaatgattt acttcctcag gaatgcaata tggataaaac acataccatggaattgctac 1560 caaaggagaa gtttgtatcc agaccaccca caccaaaatg tgttattgatattacaaatg 1620 acactaattt agaaaaggtg gctcaggaaa actcaagtac ctttggccttcagacacttc 1680 agaaaatgga tcctaatgtt agtgattcaa aacactctat tgcaaatgcaaaattcttgg 1740 aaacagcaaa aaaagattct gaccagagct gggtcagtga tgtagttaaagtggatctaa 1800 cccaatcaag tgttacaaat gcttcttcag gaaatgatca cttgaacatggaaaaagaga 1860 agtatgtctc ttacatttct cctttaagtg cagtttctgt catggaagataagctgcata 1920 agcgaagtcc acctccagag actataaaat ctaaacttaa tacttcagtagatactcaca 1980 agataaaatc cagcccatca cctgaagttg ttaaacccaa aataactcattctcctgatt 2040 ctgtaaagtc taaggccact tatgtgaaca gccaagctac tggtgaaagaagattggcaa 2100 ataagataga acatgagcta tcaagatgca gttttcatcc aattcctactcgaagcagta 2160 cattagaaac tacaaagagt cctcttatca ttgataaaaa tgagcattttacagtttaca 2220 gagatcctgc acttattggg tcagaaacag gagctaatca tatttcacctttcctaagcc 2280 agcatccttt tcctcttcac tcctcatctc atagaacctg tttaaatccaggtacccatc 2340 atcctgcctt aactcctgca ccccatttac tagccggatc atctagtcaaactccattac 2400 ctaccattaa cactcatcct ctgactagtg gtccacacca tgctgttcatcaccctcatt 2460 tacttcccac tgtgttacct ggagtgccta ctgcctcctt acttggtggccacccacgac 2520 tagagagtgc tcatgccagc agcttgagcc acttagcgct agcacaccagcaacaacaac 2580 agttgttaca gcaccagtca cctcatcttc ttggacaagc ccatccttctgcttcatata 2640 atcagcttgg actttatcca attatttggc agtatccaaa tggaacacatgcatactcag 2700 gacttggttt gccttcttct aagtgggttc acccagaaaa tgcagttaatgctgaagctt 2760 cattaaggag gaattctccc agtccttggc tacatcagcc cacccctgtgacctcagcag 2820 atggtattgg attacttagt cacattcctg tcagaccttc cagtgcagagcctcatcggc 2880 ctcttaaaat tacagcccat tccagtccac cattgacaaa aactttagtagatcatcata 2940 aggaagaatt agaaagaaaa gcttttatgg aaccattacg gtctgttgcatccacatcag 3000 ccaaaaatga cctggatcta aataggtcac agactggaaa agattgtcacttacataggc 3060 attttgtgga tccagtatta aatcagttac agaggccacc ccaggagactggagagaggt 3120 taaacaaata caaagaggaa caccgtcgaa ttcttcaaga aagtattgatgttgctccct 3180 ttacaactaa aatcaaggga cttgagggtg agagagagaa ttattccagagtggcatcat 3240 catcttccag tcctaaaagc catatcatca aacaagatat ggatgtagaacgctcagtat 3300 cagatcttta taaaatgaag cactcagtgc ctcagagttt accccaaagtaactatttca 3360 ctacattgtc taatagtgtg gtcaatgaac caccaagatc atacccatccaaagaagttt 3420 caaatattta cggtgataaa cagagtaatg cccttgcagc ggcagcagctaatcctcaaa 3480 ctctgacttc atttataaca tctctttcaa agcctccacc tttgattaaacaccaaccag 3540 aaagtgaagg tttagtaggc aagataccag aacatcttcc acatcagattgcatctcact 3600 cagtaacaac cttcagaaat gattgtagga gtcctaccca tttgacagtttcttctacaa 3660 atacactccg cagtatgcct gcattacata gagcaccagt atttcacccaccaatccatc 3720 acagcctgga aagaaaggaa ggcagctata gtagtctttc ccctccaactttaactccgg 3780 tgatgccagt aaatgctggt ggtaaagttc aagaatcaca gaagcctccaactctaatac 3840 ccgaaccaaa agactcccag gcaaatttta agagttcttc agaacagagtttgacggaga 3900 tgtggagacc taataataac ctcagcaaag agaaaactga atggcatgtggagaaaagca 3960 gcggaaagtt acaggctgct atggcatctg tcattgtgcg tccatcttctagtacaaaaa 4020 ctgatagtat gccagcaatg cagttagctt ctaaagatcg agttagtgaaagatcttcag 4080 ctggggcaca taaaacagat tgcctcaaac tagcagaagc cggagaaactggaagaatca 4140 ttttgccaaa tgtgaattca gacagtgttc acacaaaatc tgaaaaaaactttcaggctg 4200 tctcacaggg cagtgttccc agttcagtca tgtctgctgt aaatacgatgtgtaatacca 4260 aaacggatgt aatcacatct gctgccgata ctaccagtgt ttccagctggggtggttcag 4320 aagtaatttc ctctttatca aataccattt tggcctctac atcatcagaatgtgtatctt 4380 caaaaagtgt cagtcagcca gtggctcaaa aacaagaatg caaggtcagcaccacagcac 4440 cagttacatt agccagtagt aagacaggaa gtgttgttca acccagttctgggttctcag 4500 gcacaactga ttttatccat ttaaaaaagc acaaggcagc attggctgcagctcagtata 4560 aaagtagtaa tgccagtgag actgaaccta atgctataaa aaatcagacactttcagcct 4620 cccttcctct ggatagcact gtaatctgta gtacaattaa caaagcaaactctgtaggaa 4680 atgggcaagc ttcccagaca agtcaaccaa actaccatac taaactgaaaaaggcctggc 4740 tcaccagaca ctcagaagaa gataaaaata ctaataaaat ggaaaattcagggaattctg 4800 tatcagaaat tattaagcca tgttctgtca acttaatagc ctctacatctagtgatatac 4860 aaaatagtgt agatagtaag atcatagttg ataaatatgt aaaagatgataaagtcaaca 4920 ggagaaaagc caaaagaact tatgaatctg gctctgaaag tggagactcagatgaaagtg 4980 aaagcaagtc agagcaaagg actaaaagac aacctaagcc aacttacaaaaagaagcaaa 5040 atgatttgca aaagagaaaa ggtgaaatag aagaagattt gaaacccaatggagttctca 5100 gcaggagtgc caaagaaaga agtaaactga agttgcaaag caacagtaatactggcattc 5160 ctcgttcagt attgaaagat tggcgtaaag tcaagaagct gaagcaaactggggaatcct 5220 ttttacagga tgactcctgc tgtgagatag ggcctaattt acaaaagtgtcgagaatgta 5280 gacttattcg cagtaaaaaa ggagaagaac cagctcactc accagtattttgtagatttt 5340 actactttag acggttgtca tttagtaaaa acggagtagt tagaatagatggtttctctt 5400 ctcctgacca atatgatgat gaagctatga gtttgtggac acatgaaaattttgaagatg 5460 atgaactaga tatagagact tctaaatata tcttggatat aataggtgataagttctgtc 5520 aattagtaac atctgaaaaa acagctttgt cctgggtgaa aaaggatgccaaaattgcct 5580 ggaaaagagc agtgagagga gtccgggaga tgtgtgatgc atgtgaagcaacattgttta 5640 acattcactg ggtctgccaa aaatgtggat ttgtggtctg cttagattgttacaaggcaa 5700 aggaaaggaa gagttctaga gataaagaac tatatgcttg gatgaagtgtgtgaagggac 5760 agcctcatga tcacaaacat ttaatgccaa cccaaattat acctggttctgttttgacag 5820 atcttctaga tgccatgcac actcttaggg aaaaatatgg tattaaatcccattgtcatt 5880 gtactaacaa acagaattta caagttggaa attttcctac aatgaatggtgtatctcaag 5940 ttttacagaa tgttcttaat cacagtaata aaatttctct gtgcatgcctgagtctcagc 6000 agcaaaatac tcctccgaag tctgagaaaa atggtggcag cagcccagagagtgatgtag 6060 gcacagataa caagttaact cctccagaat cccagtcacc actgcactggttagcagatc 6120 ttgcagagca aaaagccaga gaggaaaaaa aagaaaacaa agaacttacccttgaaaacc 6180 aaattaaaga agaaagagaa caagacaact ctgaatctcc aaatggcagaacatcacctc 6240 ttgtgtccca gaataatgaa caaggctcaa ccttacggga tttgctgactacaacagctg 6300 gaaagctacg tgtggggtct acagatgctg gcattgcctt tgccccagtatattcaatgg 6360 gagccccaag tagcaaaagt ggacggacta tgcctaacat tcttgatgacataattgctt 6420 cagttgttga aaacaaaatt ccaccaagta aaacctccaa gataaatgtaaaaccagagc 6480 ttaaagaaga gcctgaagaa agcataatat ctgcagtgga tgaaaataataaattataca 6540 gtgatatacc acattcttgg atctgtgaga agcatatttt atggcttaaggattataaga 6600 atagcagtaa ttggaagctt ttcaaagaat gttggaaaca aggacagcctgcagtggttt 6660 ctggtgtgca taagaaaatg aacattagcc tatggaaggc ggaatcaattagtcttgatt 6720 ttggagacca ccaagctgat ctcctgaact gcaaagatag catcatttcaaatgccaatg 6780 ttaaggaatt ctgggatggt tttgaagaag tttcaaaacg gcagaaaaacaagagtggag 6840 aaacagttgt tttaaaattg aaagactggc cttcaggaga agacttcaagactatgatgc 6900 cagcaagata cgaagatctt ttaaaaagtc tgccattgcc agaatattgtaatccagaag 6960 gaaaattcaa tttggcctct catttgccag gattttttgt acgtcctgatctaggaccca 7020 ggttgtgcag tgcctatggt gtagttgctg ctaaagatca tgatataggaacaacaaatc 7080 tccatattga agtttctgat gttgtaaata tactagttta tgttggcatagcaaaaggaa 7140 atggcattct ctcaaaagca ggaattctca agaaatttga ggaagaagatttggatgaca 7200 ttttaaggaa aagattgaag gactcaagtg aaatacctgg tgctctgtggcatatttatg 7260 ctgggaaaga tgttgacaag ataagggaat ttcttcaaaa gatttcaaaagaacaaggcc 7320 ttgaagttct accagaacat gatccaatac gtgaccaaag ttggtatgtgaacaaaaagc 7380 tccgtcaaag gctgcttgaa gaatatggag tcagaacctg tactcttattcagttccttg 7440 gtgatgctat tgttttgcca gcgggagcac ttcatcaggt tcagaattttcacagctgta 7500 ttcaggtaac tgaagatttt gtgtctccag aacatcttgt agagtcatttcatttaacac 7560 aggaactgag acttttgaag gaagaaatca attatgatga taaactacaggttaaaaata 7620 ttttgtatca tgcagtcaaa gaaatggtga gagccttgaa gatacacgaggatgaagtag 7680 aggatatgga agaaaattaa gtgtgatcca gtttgatatt tttaggttgttgaactggga 7740 ttacttaacc ttgaatgatg atatgtatgc acactgactt taagcttcataaaaccatca 7800 gtgccaagaa attctctttg tagtaattac ttgttactga caccacagcagtatagcata 7860 tgtcacagct cctgtgattc aatgttataa aacaagcaga attttaaaagcagcactata 7920 tagctgtttt gtattatagt gtatatgatg tttgtgaaaa tgccagatttaaaatgatgt 7980 atttattttt ggtaaaaaat aaaaaattct atgctatatt gttgatcaagtgtaaatgtg 8040 accttgtaca gtttactaaa attactgata tttttcacta cattgagacagttactgtga 8100 gaataggaca caaacaccag ctattgcctg catctgggaa attgctgaatcgcacagcag 8160 tcatgtcata atcagaaaat tactgccaaa taattgtaaa atttgtaaagtataaagtat 8220 ataaagtaga tactaaatac agacacttca atattttgtt gaagctattgactgtacaat 8280 taaacatttt caaaaggtgt aatttattta aaattgtctc attttggtaaaatttatgtg 8340 aacttttaaa gctaaatatt aaacttaata tgctatgtaa atatatacatatatacattt 8400 aatgatgtat ttttttaaaa cattggcttg cttttgttaa agtgcaagtgttacatatgg 8460 ctttgtacat taaagttgaa aggggtttta cattttccat taaaaggactttatc 8515 8 8516 DNA Homo sapiens 8 ttgcggggag ggcccgaggt cgctgtgtccggggcagagc ggccggttcg tcccgagtct 60 gcgctctttc gggtccgctg ctgtgtcccggtccgctctc ctcagcatga gcggccggta 120 ggagtgaggt ttcatcggtt ccctcgcactggaggaggca gcggccgctt cggcagcgac 180 agctatggcg gtagagacgc gggcagagctggtgggtaag cggttcctgt gtgtggcggt 240 cggcgacgag gcacgttcgg agcgctgggagagcggacgc ggctggcgaa gctggcgagc 300 gggggtcatc cgagccgtgt cacacagggacagccgcaat ccggacctgg cggtgtatgt 360 ggaatttgat gatcttgaat gggataaacgagagtgggtt aaagtttatg aagatttttc 420 aactttcttg gtggaatacc acttaatctgggccaaaagg aatgacccta gccagactca 480 gggatcaaag agcaaacaga ttcagtggcctgcattgact ttcaaacctc tggttgaaag 540 aaatataccc agttcagtca ctgcagtagaattccttgta gataagcaac tggatttttt 600 aactgaagat agtgcctttc agccctaccaggtccttatt ccttaaatgg atacagagtg 660 agagtatata gacaagactc tgccacccagtggtttactg gcataattac tcatcatgat 720 ctcttcaccc gcaccatgat cgttatgaatgatcaggtac tagaaccaca gaatgtcgat 780 ccttctatgg ttcaaatgac ctttctagatgatgttgttc actctttgtt aaaaggtgaa 840 aatattggca ttacatcacg acgcaggtctcgtgccaatc aaaacgtcaa cgctgttcac 900 agccattata cacgtgccca agcaaatagtcccagaccag caatgaactc ccaagctgct 960 gtaccaaaac agaatacaca ccagcaacagcaacaaagaa gtatccgtcc aaataagagg 1020 aagggctcag atagcagtat accagatgaagagaagatga aggaggaaaa atatgattat 1080 atatcacgag gagaaaatcc taaaggtaaaaacaaacact tgatgaataa aagaaggaaa 1140 cctgaggagg atgaaaagaa actaaatatgaaaagacttc gaactgacaa tgtttcagac 1200 ttttctgaga gcagtgactc agaaaattcaaataagagaa taatagataa ttcctcagaa 1260 cagaagccag agaatgaatt gaaaaataaaaatacttcaa aaataaatgg agaagaagga 1320 aaaccccata ataatgagaa ggcaggagaagagaccctaa aaaatagcca gcctccctgg 1380 gatcaaatac aggaagataa aaaacatgaagaagcagaga agcggaagtc tgttgacact 1440 cagcttcaag aagatatgat tattcattcgtcagaacagt ccacagtttc tgatcataat 1500 tctaatgatt tacttcctca ggaatgcaatatggataaaa cacataccat ggaattgcta 1560 ccaaaggaga agtttgtatc cagaccacccacaccaaaat gtgttattga tattacaaat 1620 gacactaatt tagaaaaggt ggctcaggaaaactcaagta cctttggcct tcagacactt 1680 cagaaaatgg atcctaatgt tagtgattcaaaacactcta ttgcaaatgc aaaattcttg 1740 gaaacagcaa aaaaagattc tgaccagagctgggtcagtg atgtagttaa agtggatcta 1800 acccaatcaa gtgttacaaa tgcttcttcaggaaatgatc acttgaacat ggaaaaagag 1860 aagtatgtct cttacatttc tcctttaagtgcagtttctg tcatggaaga taagctgcat 1920 aagcgaagtc cacctccaga gactataaaatctaaactta atacttcagt agatactcac 1980 aagataaaat ccagcccatc acctgaagttgttaaaccca aaataactca ttctcctgat 2040 tctgtaaagt ctaaggccac ttatgtgaacagccaagcta ctggtgaaag aagattggca 2100 aataagatag aacatgagct atcaagatgcagttttcatc caattcctac tcgaagcagt 2160 acattagaaa ctacaaagag tcctcttatcattgataaaa atgagcattt tacagtttac 2220 agagatcctg cacttattgg gtcagaaacaggagctaatc atatttcacc tttcctaagc 2280 cagcatcctt ttcctcttca ctcctcatctcatagaacct gtttaaatcc aggtacccat 2340 catcctgcct taactcctgc accccatttactagccggat catctagtca aactccatta 2400 cctaccatta acactcatcc tctgactagtggtccacacc atgctgttca tcaccctcat 2460 ttacttccca ctgtgttacc tggagtgcctactgcctcct tacttggtgg ccacccacga 2520 ctagagagtg ctcatgccag cagcttgagccacttagcgc tagcacacca gcaacaacaa 2580 cagttgttac agcaccagtc acctcatcttcttggacaag cccatccttc tgcttcatat 2640 aatcagcttg gactttatcc aattatttggcagtatccaa atggaacaca tgcatactca 2700 ggacttggtt tgccttcttc taagtgggttcacccagaaa atgcagttaa tgctgaagct 2760 tcattaagga ggaattctcc cagtccttggctacatcagc ccacccctgt gacctcagca 2820 gatggtattg gattacttag tcacattcctgtcagacctt ccagtgcaga gcctcatcgg 2880 cctcttaaaa ttacagccca ttccagtccaccattgacaa aaactttagt agatcatcat 2940 aaggaagaat tagaaagaaa agcttttatggaaccattac ggtctgttgc atccacatca 3000 gccaaaaatg acctggatct aaataggtcacagactggaa aagattgtca cttacatagg 3060 cattttgtgg atccagtatt aaatcagttacagaggccac cccaggagac tggagagagg 3120 ttaaacaaat acaaagagga acaccgtcgaattcttcaag aaagtattga tgttgctccc 3180 tttacaacta aaatcaaggg acttgagggtgagagagaga attattccag agtggcatca 3240 tcatcttcca gtcctaaaag ccatatcatcaaacaagata tggatgtaga acgctcagta 3300 tcagatcttt ataaaatgaa gcactcagtgcctcagagtt taccccaaag taactatttc 3360 actacattgt ctaatagtgt ggtcaatgaaccaccaagat catacccatc caaagaagtt 3420 tcaaatattt acggtgataa acagagtaatgcccttgcag cggcagcagc taatcctcaa 3480 actctgactt catttataac atctctttcaaagcctccac ctttgattaa acaccaacca 3540 gaaagtgaag gtttagtagg caagataccagaacatcttc cacatcagat tgcatctcac 3600 tcagtaacaa ccttcagaaa tgattgtaggagtcctaccc atttgacagt ttcttctaca 3660 aatacactcc gcagtatgcc tgcattacatagagcaccag tatttcaccc accaatccat 3720 cacagcctgg aaagaaagga aggcagctatagtagtcttt cccctccaac tttaactccg 3780 gtgatgccag taaatgctgg tggtaaagttcaagaatcac agaagcctcc aactctaata 3840 cccgaaccaa aagactccca ggcaaattttaagagttctt cagaacagag tttgacggag 3900 atgtggagac ctaataataa cctcagcaaagagaaaactg aatggcatgt ggagaaaagc 3960 agcggaaagt tacaggctgc tatggcatctgtcattgtgc gtccatcttc tagtacaaaa 4020 actgatagta tgccagcaat gcagttagcttctaaagatc gagttagtga aagatcttca 4080 gctggggcac ataaaacaga ttgcctcaaactagcagaag ccggagaaac tggaagaatc 4140 attttgccaa atgtgaattc agacagtgttcacacaaaat ctgaaaaaaa ctttcaggct 4200 gtctcacagg gcagtgttcc cagttcagtcatgtctgctg taaatacgat gtgtaatacc 4260 aaaacggatg taatcacatc tgctgccgatactaccagtg tttccagctg gggtggttca 4320 gaagtaattt cctctttatc aaataccattttggcctcta catcatcaga atgtgtatct 4380 tcaaaaagtg tcagtcagcc agtggctcaaaaacaagaat gcaaggtcag caccacagca 4440 ccagttacat tagccagtag taagacaggaagtgttgttc aacccagttc tgggttctca 4500 ggcacaactg attttatcca tttaaaaaagcacaaggcag cattggctgc agctcagtat 4560 aaaagtagta atgccagtga gactgaacctaatgctataa aaaatcagac actttcagcc 4620 tcccttcctc tggatagcac tgtaatctgtagtacaatta acaaagcaaa ctctgtagga 4680 aatgggcaag cttcccagac aagtcaaccaaactaccata ctaaactgaa aaaggcctgg 4740 ctcaccagac actcagaaga agataaaaatactaataaaa tggaaaattc agggaattct 4800 gtatcagaaa ttattaagcc atgttctgtcaacttaatag cctctacatc tagtgatata 4860 caaaatagtg tagatagtaa gatcatagttgataaatatg taaaagatga taaagtcaac 4920 aggagaaaag ccaaaagaac ttatgaatctggctctgaaa gtggagactc agatgaaagt 4980 gaaagcaagt cagagcaaag gactaaaagacaacctaagc caacttacaa aaagaagcaa 5040 aatgatttgc aaaagagaaa aggtgaaatagaagaagatt tgaaacccaa tggagttctc 5100 agcaggagtg ccaaagaaag aagtaaactgaagttgcaaa gcaacagtaa tactggcatt 5160 cctcgttcag tattgaaaga ttggcgtaaagtcaagaagc tgaagcaaac tggggaatcc 5220 tttttacagg atgactcctg ctgtgagatagggcctaatt tacaaaagtg tcgagaatgt 5280 agacttattc gcagtaaaaa aggagaagaaccagctcact caccagtatt ttgtagattt 5340 tactacttta gacggttgtc atttagtaaaaacggagtag ttagaataga tggtttctct 5400 tctcctgacc aatatgatga tgaagctatgagtttgtgga cacatgaaaa ttttgaagat 5460 gatgaactag atatagagac ttctaaatatatcttggata taataggtga taagttctgt 5520 caattagtaa catctgaaaa aacagctttgtcctgggtga aaaaggatgc caaaattgcc 5580 tggaaaagag cagtgagagg agtccgggagatgtgtgatg catgtgaagc aacattgttt 5640 aacattcact gggtctgcca aaaatgtggatttgtggtct gcttagattg ttacaaggca 5700 aaggaaagga agagttctag agataaagaactatatgctt ggatgaagtg tgtgaaggga 5760 cagcctcatg atcacaaaca tttaatgccaacccaaatta tacctggttc tgttttgaca 5820 gatcttctag atgccatgca cactcttagggaaaaatatg gtattaaatc ccattgtcat 5880 tgtactaaca aacagaattt acaagttggaaattttccta caatgaatgg tgtatctcaa 5940 gttttacaga atgttcttaa tcacagtaataaaatttctc tgtgcatgcc tgagtctcag 6000 cagcaaaata ctcctccgaa gtctgagaaaaatggtggca gcagcccaga gagtgatgta 6060 ggcacagata acaagttaac tcctccagaatcccagtcac cactgcactg gttagcagat 6120 cttgcagagc aaaaagccag agaggaaaaaaaagaaaaca aagaacttac ccttgaaaac 6180 caaattaaag aagaaagaga acaagacaactctgaatctc caaatggcag aacatcacct 6240 cttgtgtccc agaataatga acaaggctcaaccttacggg atttgctgac tacaacagct 6300 ggaaagctac gtgtggggtc tacagatgctggcattgcct ttgccccagt atattcaatg 6360 ggagccccaa gtagcaaaag tggacggactatgcctaaca ttcttgatga cataattgct 6420 tcagttgttg aaaacaaaat tccaccaagtaaaacctcca agataaatgt aaaaccagag 6480 cttaaagaag agcctgaaga aagcataatatctgcagtgg atgaaaataa taaattatac 6540 agtgatatac cacattcttg gatctgtgagaagcatattt tatggcttaa ggattataag 6600 aatagcagta attggaagct tttcaaagaatgttggaaac aaggacagcc tgcagtggtt 6660 tctggtgtgc ataagaaaat gaacattagcctatggaagg cggaatcaat tagtcttgat 6720 tttggagacc accaagctga tctcctgaactgcaaagata gcatcatttc aaatgccaat 6780 gttaaggaat tctgggatgg ttttgaagaagtttcaaaac ggcagaaaaa caagagtgga 6840 gaaacagttg ttttaaaatt gaaagactggccttcaggag aagacttcaa gactatgatg 6900 ccagcaagat acgaagatct tttaaaaagtctgccattgc cagaatattg taatccagaa 6960 ggaaaattca atttggcctc tcatttgccaggattttttg tacgtcctga tctaggaccc 7020 aggttgtgca gtgcctatgg tgtagttgctgctaaagatc atgatatagg aacaacaaat 7080 ctccatattg aagtttctga tgttgtaaatatactagttt atgttggcat agcaaaagga 7140 aatggcattc tctcaaaagc aggaattctcaagaaatttg aggaagaaga tttggatgac 7200 attttaagga aaagattgaa ggactcaagtgaaatacctg gtgctctgtg gcatatttat 7260 gctgggaaag atgttgacaa gataagggaatttcttcaaa agatttcaaa agaacaaggc 7320 cttgaagttc taccagaaca tgatccaatacgtgaccaaa gttggtatgt gaacaaaaag 7380 ctccgtcaaa ggctgcttga agaatatggagtcagaacct gtactcttat tcagttcctt 7440 ggtgatgcta ttgttttgcc agcgggagcacttcatcagg ttcagaattt tcacagctgt 7500 attcaggtaa ctgaagattt tgtgtctccagaacatcttg tagagtcatt tcatttaaca 7560 caggaactga gacttttgaa ggaagaaatcaattatgatg ataaactaca ggttaaaaat 7620 attttgtatc atgcagtcaa agaaatggtgagagccttga agatacacga ggatgaagta 7680 gaggatatgg aagaaaatta agtgtgatccagtttgatat ttttaggttg ttgaactggg 7740 attacttaac cttgaatgat gatatgtatgcacactgact ttaagcttca taaaaccatc 7800 agtgccaaga aattctcttt gtagtaattacttgttactg acaccacagc agtatagcat 7860 atgtcacagc tcctgtgatt caatgttataaaacaagcag aattttaaaa gcagcactat 7920 atagctgttt tgtattatag tgtatatgatgtttgtgaaa atgccagatt taaaatgatg 7980 tatttatttt tggtaaaaaa taaaaaattctatgctatat tgttgatcaa gtgtaaatgt 8040 gaccttgtac agtttactaa aattactgatatttttcact acattgagac agttactgtg 8100 agaataggac acaaacacca gctattgcctgcatctggga aattgctgaa tcgcacagca 8160 gtcatgtcat aatcagaaaa ttactgccaaataattgtaa aatttgtaaa gtataaagta 8220 tataaagtag atactaaata cagacacttcaatattttgt tgaagctatt gactgtacaa 8280 ttaaacattt tcaaaaggtg taatttatttaaaattgtct cattttggta aaatttatgt 8340 gaacttttaa agctaaatat taaacttaatatgctatgta aatatataca tatatacatt 8400 taatgatgta tttttttaaa acattggcttgcttttgtta aagtgcaagt gttacatatg 8460 gctttgtaca ttaaagttga aaggggttttacattttcca ttaaaaggac tttatc 8516 9 8479 DNA Homo sapiens 9 ttgcggggagggcccgaggt cgctgtgtcc ggggcagagc ggccggttcg tcccgagtct 60 gcgctctttcgggtccgctg ctgtgtcccg gtccgctctc ctcagcatga gcggccggag 120 gcagcggccgcttcggcagc gacagctatg gcggtagaga cgcgggcaga gctggtgggt 180 aagcggttcctgtgtgtggc ggtcggcgac gaggcacgtt cggagcgctg ggagagcgga 240 cgcggctggcgaagctggcg agcgggggtc atccgagccg tgtcacacag ggacagccgc 300 aatccggacctggcggtgta tgtggaattt gatgatcttg aatgggataa acgagagtgg 360 gttaaagtttatgaagattt ttcaactttc ttggtggaat accacttaat ctgggccaaa 420 aggaatgaccctagccagac tcagggatca aagagcaaac agattcagtg gcctgcattg 480 actttcaaacctctggttga aagaaatata cccagttcag tcactgcagt agaattcctt 540 gtagataagcaactggattt tttaactgaa gatagtgcct ttcagcccta ccaggtcctt 600 attccttaaatggatacaga gtgagagtat atagacaaga ctctgccacc cagtggttta 660 ctggcataattactcatcat gatctcttca cccgcaccat gatcgttatg aatgatcagg 720 tactagaaccacagaatgtc gatccttcta tggttcaaat gacctttcta gatgatgttg 780 ttcactctttgttaaaaggt gaaaatattg gcattacatc acgacgcagg tctcgtgcca 840 atcaaaacgtcaacgctgtt cacagccatt atacacgtgc ccaagcaaat agtcccagac 900 cagcaatgaactcccaagct gctgtaccaa aacagaatac acaccagcaa cagcaacaaa 960 gaagtatccgtccaaataag aggaagggct cagatagcag tataccagat gaagagaaga 1020 tgaaggaggaaaaatatgat tatatatcac gaggagaaaa tcctaaaggt aaaaacaaac 1080 acttgatgaataaaagaagg aaacctgagg aggatgaaaa gaaactaaat atgaaaagac 1140 ttcgaactgacaatgtttca gacttttctg agagcagtga ctcagaaaat tcaaataaga 1200 gaataatagataattcctca gaacagaagc cagagaatga attgaaaaat aaaaatactt 1260 caaaaataaatggagaagaa ggaaaacccc ataataatga gaaggcagga gaagagaccc 1320 taaaaaatagccagcctccc tgggatcaaa tacaggaaga taaaaaacat gaagaagcag 1380 agaagcggaagtctgttgac actcagcttc aagaagatat gattattcat tcgtcagaac 1440 agtccacagtttctgatcat aattctaatg atttacttcc tcaggaatgc aatatggata 1500 aaacacataccatggaattg ctaccaaagg agaagtttgt atccagacca cccacaccaa 1560 aatgtgttattgatattaca aatgacacta atttagaaaa ggtggctcag gaaaactcaa 1620 gtacctttggccttcagaca cttcagaaaa tggatcctaa tgttagtgat tcaaaacact 1680 ctattgcaaatgcaaaattc ttggaaacag caaaaaaaga ttctgaccag agctgggtca 1740 gtgatgtagttaaagtggat ctaacccaat caagtgttac aaatgcttct tcaggaaatg 1800 atcacttgaacatggaaaaa gagaagtatg tctcttacat ttctccttta agtgcagttt 1860 ctgtcatggaagataagctg cataagcgaa gtccacctcc agagactata aaatctaaac 1920 ttaatacttcagtagatact cacaagataa aatccagccc atcacctgaa gttgttaaac 1980 ccaaaataactcattctcct gattctgtaa agtctaaggc cacttatgtg aacagccaag 2040 ctactggtgaaagaagattg gcaaataaga tagaacatga gctatcaaga tgcagttttc 2100 atccaattcctactcgaagc agtacattag aaactacaaa gagtcctctt atcattgata 2160 aaaatgagcattttacagtt tacagagatc ctgcacttat tgggtcagaa acaggagcta 2220 atcatatttcacctttccta agccagcatc cttttcctct tcactcctca tctcatagaa 2280 cctgtttaaatccaggtacc catcatcctg ccttaactcc tgcaccccat ttactagccg 2340 gatcatctagtcaaactcca ttacctacca ttaacactca tcctctgact agtggtccac 2400 accatgctgttcatcaccct catttacttc ccactgtgtt acctggagtg cctactgcct 2460 ccttacttggtggccaccca cgactagaga gtgctcatgc cagcagcttg agccacttag 2520 cgctagcacaccagcaacaa caacagttgt tacagcacca gtcacctcat cttcttggac 2580 aagcccatccttctgcttca tataatcagc ttggacttta tccaattatt tggcagtatc 2640 caaatggaacacatgcatac tcaggacttg gtttgccttc ttctaagtgg gttcacccag 2700 aaaatgcagttaatgctgaa gcttcattaa ggaggaattc tcccagtcct tggctacatc 2760 agcccacccctgtgacctca gcagatggta ttggattact tagtcacatt cctgtcagac 2820 cttccagtgcagagcctcat cggcctctta aaattacagc ccattccagt ccaccattga 2880 caaaaactttagtagatcat cataaggaag aattagaaag aaaagctttt atggaaccat 2940 tacggtctgttgcatccaca tcagccaaaa atgacctgga tctaaatagg tcacagactg 3000 gaaaagattgtcacttacat aggcattttg tggatccagt attaaatcag ttacagaggc 3060 caccccaggagactggagag aggttaaaca aatacaaaga ggaacaccgt cgaattcttc 3120 aagaaagtattgatgttgct ccctttacaa ctaaaatcaa gggacttgag ggtgagagag 3180 agaattattccagagtggca tcatcatctt ccagtcctaa aagccatatc atcaaacaag 3240 atatggatgtagaacgctca gtatcagatc tttataaaat gaagcactca gtgcctcaga 3300 gtttaccccaaagtaactat ttcactacat tgtctaatag tgtggtcaat gaaccaccaa 3360 gatcatacccatccaaagaa gtttcaaata tttacggtga taaacagagt aatgcccttg 3420 cagcggcagcagctaatcct caaactctga cttcatttat aacatctctt tcaaagcctc 3480 cacctttgattaaacaccaa ccagaaagtg aaggtttagt aggcaagata ccagaacatc 3540 ttccacatcagattgcatct cactcagtaa caaccttcag aaatgattgt aggagtccta 3600 cccatttgacagtttcttct acaaatacac tccgcagtat gcctgcatta catagagcac 3660 cagtatttcacccaccaatc catcacagcc tggaaagaaa ggaaggcagc tatagtagtc 3720 tttcccctccaactttaact ccggtgatgc cagtaaatgc tggtggtaaa gttcaagaat 3780 cacagaagcctccaactcta atacccgaac caaaagactc ccaggcaaat tttaagagtt 3840 cttcagaacagagtttgacg gagatgtgga gacctaataa taacctcagc aaagagaaaa 3900 ctgaatggcatgtggagaaa agcagcggaa agttacaggc tgctatggca tctgtcattg 3960 tgcgtccatcttctagtaca aaaactgata gtatgccagc aatgcagtta gcttctaaag 4020 atcgagttagtgaaagatct tcagctgggg cacataaaac agattgcctc aaactagcag 4080 aagccggagaaactggaaga atcattttgc caaatgtgaa ttcagacagt gttcacacaa 4140 aatctgaaaaaaactttcag gctgtctcac agggcagtgt tcccagttca gtcatgtctg 4200 ctgtaaatacgatgtgtaat accaaaacgg atgtaatcac atctgctgcc gatactacca 4260 gtgtttccagctggggtggt tcagaagtaa tttcctcttt atcaaatacc attttggcct 4320 ctacatcatcagaatgtgta tcttcaaaaa gtgtcagtca gccagtggct caaaaacaag 4380 aatgcaaggtcagcaccaca gcaccagtta cattagccag tagtaagaca ggaagtgttg 4440 ttcaacccagttctgggttc tcaggcacaa ctgattttat ccatttaaaa aagcacaagg 4500 cagcattggctgcagctcag tataaaagta gtaatgccag tgagactgaa cctaatgcta 4560 taaaaaatcagacactttca gcctcccttc ctctggatag cactgtaatc tgtagtacaa 4620 ttaacaaagcaaactctgta ggaaatgggc aagcttccca gacaagtcaa ccaaactacc 4680 atactaaactgaaaaaggcc tggctcacca gacactcaga agaagataaa aatactaata 4740 aaatggaaaattcagggaat tctgtatcag aaattattaa gccatgttct gtcaacttaa 4800 tagcctctacatctagtgat atacaaaata gtgtagatag taagatcata gttgataaat 4860 atgtaaaagatgataaagtc aacaggagaa aagccaaaag aacttatgaa tctggctctg 4920 aaagtggagactcagatgaa agtgaaagca agtcagagca aaggactaaa agacaaccta 4980 agccaacttacaaaaagaag caaaatgatt tgcaaaagag aaaaggtgaa atagaagaag 5040 atttgaaacccaatggagtt ctcagcagga gtgccaaaga aagaagtaaa ctgaagttgc 5100 aaagcaacagtaatactggc attcctcgtt cagtattgaa agattggcgt aaagtcaaga 5160 agctgaagcaaactggggaa tcctttttac aggatgactc ctgctgtgag atagggccta 5220 atttacaaaagtgtcgagaa tgtagactta ttcgcagtaa aaaaggagaa gaaccagctc 5280 actcaccagtattttgtaga ttttactact ttagacggtt gtcatttagt aaaaacggag 5340 tagttagaatagatggtttc tcttctcctg accaatatga tgatgaagct atgagtttgt 5400 ggacacatgaaaattttgaa gatgatgaac tagatataga gacttctaaa tatatcttgg 5460 atataataggtgataagttc tgtcaattag taacatctga aaaaacagct ttgtcctggg 5520 tgaaaaaggatgccaaaatt gcctggaaaa gagcagtgag aggagtccgg gagatgtgtg 5580 atgcatgtgaagcaacattg tttaacattc actgggtctg ccaaaaatgt ggatttgtgg 5640 tctgcttagattgttacaag gcaaaggaaa ggaagagttc tagagataaa gaactatatg 5700 cttggatgaagtgtgtgaag ggacagcctc atgatcacaa acatttaatg ccaacccaaa 5760 ttatacctggttctgttttg acagatcttc tagatgccat gcacactctt agggaaaaat 5820 atggtattaaatcccattgt cattgtacta acaaacagaa tttacaagtt ggaaattttc 5880 ctacaatgaatggtgtatct caagttttac agaatgttct taatcacagt aataaaattt 5940 ctctgtgcatgcctgagtct cagcagcaaa atactcctcc gaagtctgag aaaaatggtg 6000 gcagcagcccagagagtgat gtaggcacag ataacaagtt aactcctcca gaatcccagt 6060 caccactgcactggttagca gatcttgcag agcaaaaagc cagagaggaa aaaaaagaaa 6120 acaaagaacttacccttgaa aaccaaatta aagaagaaag agaacaagac aactctgaat 6180 ctccaaatggcagaacatca cctcttgtgt cccagaataa tgaacaaggc tcaaccttac 6240 gggatttgctgactacaaca gctggaaagc tacgtgtggg gtctacagat gctggcattg 6300 cctttgccccagtatattca atgggagccc caagtagcaa aagtggacgg actatgccta 6360 acattcttgatgacataatt gcttcagttg ttgaaaacaa aattccacca agtaaaacct 6420 ccaagataaatgtaaaacca gagcttaaag aagagcctga agaaagcata atatctgcag 6480 tggatgaaaataataaatta tacagtgata taccacattc ttggatctgt gagaagcata 6540 ttttatggcttaaggattat aagaatagca gtaattggaa gcttttcaaa gaatgttgga 6600 aacaaggacagcctgcagtg gtttctggtg tgcataagaa aatgaacatt agcctatgga 6660 aggcggaatcaattagtctt gattttggag accaccaagc tgatctcctg aactgcaaag 6720 atagcatcatttcaaatgcc aatgttaagg aattctggga tggttttgaa gaagtttcaa 6780 aacggcagaaaaacaagagt ggagaaacag ttgttttaaa attgaaagac tggccttcag 6840 gagaagacttcaagactatg atgccagcaa gatacgaaga tcttttaaaa agtctgccat 6900 tgccagaatattgtaatcca gaaggaaaat tcaatttggc ctctcatttg ccaggatttt 6960 ttgtacgtcctgatctagga cccaggttgt gcagtgccta tggtgtagtt gctgctaaag 7020 atcatgatataggaacaaca aatctccata ttgaagtttc tgatgttgta aatatactag 7080 tttatgttggcatagcaaaa ggaaatggca ttctctcaaa agcaggaatt ctcaagaaat 7140 ttgaggaagaagatttggat gacattttaa ggaaaagatt gaaggactca agtgaaatac 7200 ctggtgctctgtggcatatt tatgctggga aagatgttga caagataagg gaatttcttc 7260 aaaagatttcaaaagaacaa ggccttgaag ttctaccaga acatgatcca atacgtgacc 7320 aaagttggtatgtgaacaaa aagctccgtc aaaggctgct tgaagaatat ggagtcagaa 7380 cctgtactcttattcagttc cttggtgatg ctattgtttt gccagcggga gcacttcatc 7440 aggttcagaattttcacagc tgtattcagg taactgaaga ttttgtgtct ccagaacatc 7500 ttgtagagtcatttcattta acacaggaac tgagactttt gaaggaagaa atcaattatg 7560 atgataaactacaggttaaa aatattttgt atcatgcagt caaagaaatg gtgagagcct 7620 tgaagatacacgaggatgaa gtagaggata tggaagaaaa ttaagtgtga tccagtttga 7680 tatttttaggttgttgaact gggattactt aaccttgaat gatgatatgt atgcacactg 7740 actttaagcttcataaaacc atcagtgcca agaaattctc tttgtagtaa ttacttgtta 7800 ctgacaccacagcagtatag catatgtcac agctcctgtg attcaatgtt ataaaacaag 7860 cagaattttaaaagcagcac tatatagctg ttttgtatta tagtgtatat gatgtttgtg 7920 aaaatgccagatttaaaatg atgtatttat ttttggtaaa aaataaaaaa ttctatgcta 7980 tattgttgatcaagtgtaaa tgtgaccttg tacagtttac taaaattact gatatttttc 8040 actacattgagacagttact gtgagaatag gacacaaaca ccagctattg cctgcatctg 8100 ggaaattgctgaatcgcaca gcagtcatgt cataatcaga aaattactgc caaataattg 8160 taaaatttgtaaagtataaa gtatataaag tagatactaa atacagacac ttcaatattt 8220 tgttgaagctattgactgta caattaaaca ttttcaaaag gtgtaattta tttaaaattg 8280 tctcattttggtaaaattta tgtgaacttt taaagctaaa tattaaactt aatatgctat 8340 gtaaatatatacatatatac atttaatgat gtattttttt aaaacattgg cttgcttttg 8400 ttaaagtgcaagtgttacat atggctttgt acattaaagt tgaaaggggt tttacatttt 8460 ccattaaaaggactttatc 8479 10 8275 DNA Homo sapiens 10 cgagagtggg ttaaagtttatgaagatttt tcaactttct tggtggaata ccacttaatc 60 tgggccaaaa ggaatgaccctagccagact cagggatcaa agagcaaaca gattcagtgg 120 cctgcattgt tctgggatacatatgcggaa cgtgccggtt tgttacatag actttcaaac 180 ctctggttga aagaaatatacccagttcag tcactgcagt agaattcctt gtagataagc 240 aactggattt tttaactgaagatagtgcct ttcagcccta ccaggacgac atagacagcc 300 taaacccagt tctcagggacaacccgcagc ttcatgagga agtgaaagtc tgggtaaagg 360 aacaaaaggt tcaggagatttttatgcaag gtccttattc cttaaatgga tacagagtga 420 gagtatatag acaagactctgccacccagt ggtttactgg cataattact catcatgatc 480 tcttcacccg caccatgatcgttatgaatg atcaggtact agaaccacag aatgtcgatc 540 cttctatggt tcaaatgacctttctagatg atgttgttca ctctttgtta aaaggtgaaa 600 atattggcat tacatcacgacgcaggtctc gtgccaatca aaacgtcaac gctgttcaca 660 gccattatac acgtgcccaagcaaatagtc ccagaccagc aatgaactcc caagctgctg 720 taccaaaaca gaatacacaccagcaacagc aacaaagaag tatccgtcca aataagagga 780 agggctcaga tagcagtataccagatgaag agaagatgaa ggaggaaaaa tatgattata 840 tatcacgagg agaaaatcctaaaggtaaaa acaaacactt gatgaataaa agaaggaaac 900 ctgaggagga tgaaaagaaactaaatatga aaagacttcg aactgacaat gtttcagact 960 tttctgagag cagtgactcagaaaattcaa ataagagaat aatagataat tcctcagaac 1020 agaagccaga gaatgaattgaaaaataaaa atacttcaaa aataaatgga gaagaaggaa 1080 aaccccataa taatgagaaggcaggagaag agaccctaaa aaatagccag cctccctggg 1140 atcaaataca ggaagataaaaaacatgaag aagcagagaa gcggaagtct gttgacactc 1200 agcttcaaga agatatgattattcattcgt cagaacagtc cacagtttct gatcataatt 1260 ctaatgattt acttcctcaggaatgcaata tggataaaac acataccatg gaattgctac 1320 caaaggagaa gtttgtatccagaccaccca caccaaaatg tgttattgat attacaaatg 1380 acactaattt agaaaaggtggctcaggaaa actcaagtac ctttggcctt cagacacttc 1440 agaaaatgga tcctaatgttagtgattcaa aacactctat tgcaaatgca aaattcttgg 1500 aaacagcaaa aaaagattctgaccagagct gggtcagtga tgtagttaaa gtggatctaa 1560 cccaatcaag tgttacaaatgcttcttcag gaaatgatca cttgaacatg gaaaaagaga 1620 agtatgtctc ttacatttctcctttaagtg cagtttctgt catggaagat aagctgcata 1680 agcgaagtcc acctccagagactataaaat ctaaacttaa tacttcagta gatactcaca 1740 agataaaatc cagcccatcacctgaagttg ttaaacccaa aataactcat tctcctgatt 1800 ctgtaaagtc taaggccacttatgtgaaca gccaagctac tggtgaaaga agattggcaa 1860 ataagataga acatgagctatcaagatgca gttttcatcc aattcctact cgaagcagta 1920 cattagaaac tacaaagagtcctcttatca ttgataaaaa tgagcatttt acagtttaca 1980 gagatcctgc acttattgggtcagaaacag gagctaatca tatttcacct ttcctaagcc 2040 agcatccttt tcctcttcactcctcatctc atagaacctg tttaaatcca ggtacccatc 2100 atcctgcctt aactcctgcaccccatttac tagccggatc atctagtcaa actccattac 2160 ctaccattaa cactcatcctctgactagtg gtccacacca tgctgttcat caccctcatt 2220 tacttcccac tgtgttacctggagtgccta ctgcctcctt acttggtggc cacccacgac 2280 tagagagtgc tcatgccagcagcttgagcc acttagcgct agcacaccag caacaacaac 2340 agttgttaca gcaccagtcacctcatcttc ttggacaagc ccatccttct gcttcatata 2400 atcagcttgg actttatccaattatttggc agtatccaaa tggaacacat gcatactcag 2460 gacttggttt gccttcttctaagtgggttc acccagaaaa tgcagttaat gctgaagctt 2520 cattaaggag gaattctcccagtccttggc tacatcagcc cacccctgtg acctcagcag 2580 atggtattgg attacttagtcacattcctg tcagaccttc cagtgcagag cctcatcggc 2640 ctcttaaaat tacagcccattccagtccac cattgacaaa aactttagta gatcatcata 2700 aggaagaatt agaaagaaaagcttttatgg aaccattacg gtctgttgca tccacatcag 2760 ccaaaaatga cctggatctaaataggtcac agactggaaa agattgtcac ttacataggc 2820 attttgtgga tccagtattaaatcagttac agaggccacc ccaggagact ggagagaggt 2880 taaacaaata caaagaggaacaccgtcgaa ttcttcaaga aagtattgat gttgctccct 2940 ttacaactaa aatcaagggacttgagggtg agagagagaa ttattccaga gtggcatcat 3000 catcttccag tcctaaaagccatatcatca aacaagatat ggatgtagaa cgctcagtat 3060 cagatcttta taaaatgaagcactcagtgc ctcagagttt accccaaagt aactatttca 3120 ctacattgtc taatagtgtggtcaatgaac caccaagatc atacccatcc aaagaagttt 3180 caaatattta cggtgataaacagagtaatg cccttgcagc ggcagcagct aatcctcaaa 3240 ctctgacttc atttataacatctctttcaa agcctccacc tttgattaaa caccaaccag 3300 aaagtgaagg tttagtaggcaagataccag aacatcttcc acatcagatt gcatctcact 3360 cagtaacaac cttcagaaatgattgtagga gtcctaccca tttgacagtt tcttctacaa 3420 atacactccg cagtatgcctgcattacata gagcaccagt atttcaccca ccaatccatc 3480 acagcctgga aagaaaggaaggcagctata gtagtctttc ccctccaact ttaactccgg 3540 tgatgccagt aaatgctggtggtaaagttc aagaatcaca gaagcctcca actctaatac 3600 ccgaaccaaa agactcccaggcaaatttta agagttcttc agaacagagt ttgacggaga 3660 tgtggagacc taataataacctcagcaaag agaaaactga atggcatgtg gagaaaagca 3720 gcggaaagtt acaggctgctatggcatctg tcattgtgcg tccatcttct agtacaaaaa 3780 ctgatagtat gccagcaatgcagttagctt ctaaagatcg agttagtgaa agatcttcag 3840 ctggggcaca taaaacagattgcctcaaac tagcagaagc cggagaaact ggaagaatca 3900 ttttgccaaa tgtgaattcagacagtgttc acacaaaatc tgaaaaaaac tttcaggctg 3960 tctcacaggg cagtgttcccagttcagtca tgtctgctgt aaatacgatg tgtaatacca 4020 aaacggatgt aatcacatctgctgccgata ctaccagtgt ttccagctgg ggtggttcag 4080 aagtaatttc ctctttatcaaataccattt tggcctctac atcatcagaa tgtgtatctt 4140 caaaaagtgt cagtcagccagtggctcaaa aacaagaatg caaggtcagc accacagcac 4200 cagttacatt agccagtagtaagacaggaa gtgttgttca acccagttct gggttctcag 4260 gcacaactga ttttatccatttaaaaaagc acaaggcagc attggctgca gctcagtata 4320 aaagtagtaa tgccagtgagactgaaccta atgctataaa aaatcagaca ctttcagcct 4380 cccttcctct ggatagcactgtaatctgta gtacaattaa caaagcaaac tctgtaggaa 4440 atgggcaagc ttcccagacaagtcaaccaa actaccatac taaactgaaa aaggcctggc 4500 tcaccagaca ctcagaagaagataaaaata ctaataaaat ggaaaattca gggaattctg 4560 tatcagaaat tattaagccatgttctgtca acttaatagc ctctacatct agtgatatac 4620 aaaatagtgt agatagtaagatcatagttg ataaatatgt aaaagatgat aaagtcaaca 4680 ggagaaaagc caaaagaacttatgaatctg gctctgaaag tggagactca gatgaaagtg 4740 aaagcaagtc agagcaaaggactaaaagac aacctaagcc aacttacaaa aagaagcaaa 4800 atgatttgca aaagagaaaaggtgaaatag aagaagattt gaaacccaat ggagttctca 4860 gcaggagtgc caaagaaagaagtaaactga agttgcaaag caacagtaat actggcattc 4920 ctcgttcagt attgaaagattggcgtaaag tcaagaagct gaagcaaact ggggaatcct 4980 ttttacagga tgactcctgctgtgagatag ggcctaattt acaaaagtgt cgagaatgta 5040 gacttattcg cagtaaaaaaggagaagaac cagctcactc accagtattt tgtagatttt 5100 actactttag acggttgtcatttagtaaaa acggagtagt tagaatagat ggtttctctt 5160 ctcctgacca atatgatgatgaagctatga gtttgtggac acatgaaaat tttgaagatg 5220 atgaactaga tatagagacttctaaatata tcttggatat aataggtgat aagttctgtc 5280 aattagtaac atctgaaaaaacagctttgt cctgggtgaa aaaggatgcc aaaattgcct 5340 ggaaaagagc agtgagaggagtccgggaga tgtgtgatgc atgtgaagca acattgttta 5400 acattcactg ggtctgccaaaaatgtggat ttgtggtctg cttagattgt tacaaggcaa 5460 aggaaaggaa gagttctagagataaagaac tatatgcttg gatgaagtgt gtgaagggac 5520 agcctcatga tcacaaacatttaatgccaa cccaaattat acctggttct gttttgacag 5580 atcttctaga tgccatgcacactcttaggg aaaaatatgg tattaaatcc cattgtcatt 5640 gtactaacaa acagaatttacaagttggaa attttcctac aatgaatggt gtatctcaag 5700 ttttacagaa tgttcttaatcacagtaata aaatttctct gtgcatgcct gagtctcagc 5760 agcaaaatac tcctccgaagtctgagaaaa atggtggcag cagcccagag agtgatgtag 5820 gcacagataa caagttaactcctccagaat cccagtcacc actgcactgg ttagcagatc 5880 ttgcagagca aaaagccagagaggaaaaaa aagaaaacaa agaacttacc cttgaaaacc 5940 aaattaaaga agaaagagaacaagacaact ctgaatctcc aaatggcaga acatcacctc 6000 ttgtgtccca gaataatgaacaaggctcaa ccttacggga tttgctgact acaacagctg 6060 gaaagctacg tgtggggtctacagatgctg gcattgcctt tgccccagta tattcaatgg 6120 gagccccaag tagcaaaagtggacggacta tgcctaacat tcttgatgac ataattgctt 6180 cagttgttga aaacaaaattccaccaagta aaacctccaa gataaatgta aaaccagagc 6240 ttaaagaaga gcctgaagaaagcataatat ctgcagtgga tgaaaataat aaattataca 6300 gtgatatacc acattcttggatctgtgaga agcatatttt atggcttaag gattataaga 6360 atagcagtaa ttggaagcttttcaaagaat gttggaaaca aggacagcct gcagtggttt 6420 ctggtgtgca taagaaaatgaacattagcc tatggaaggc ggaatcaatt agtcttgatt 6480 ttggagacca ccaagctgatctcctgaact gcaaagatag catcatttca aatgccaatg 6540 ttaaggaatt ctgggatggttttgaagaag tttcaaaacg gcagaaaaac aagagtggag 6600 aaacagttgt tttaaaattgaaagactggc cttcaggaga agacttcaag actatgatgc 6660 cagcaagata cgaagatcttttaaaaagtc tgccattgcc agaatattgt aatccagaag 6720 gaaaattcaa tttggcctctcatttgccag gattttttgt acgtcctgat ctaggaccca 6780 ggttgtgcag tgcctatggtgtagttgctg ctaaagatca tgatatagga acaacaaatc 6840 tccatattga agtttctgatgttgtaaata tactagttta tgttggcata gcaaaaggaa 6900 atggcattct ctcaaaagcaggaattctca agaaatttga ggaagaagat ttggatgaca 6960 ttttaaggaa aagattgaaggactcaagtg aaatacctgg tgctctgtgg catatttatg 7020 ctgggaaaga tgttgacaagataagggaat ttcttcaaaa gatttcaaaa gaacaaggcc 7080 ttgaagttct accagaacatgatccaatac gtgaccaaag ttggtatgtg aacaaaaagc 7140 tccgtcaaag gctgcttgaagaatatggag tcagaacctg tactcttatt cagttccttg 7200 gtgatgctat tgttttgccagcgggagcac ttcatcaggt tcagaatttt cacagctgta 7260 ttcaggtaac tgaagattttgtgtctccag aacatcttgt agagtcattt catttaacac 7320 aggaactgag acttttgaaggaagaaatca attatgatga taaactacag gttaaaaata 7380 ttttgtatca tgcagtcaaagaaatggtga gagccttgaa gatacacgag gatgaagtag 7440 aggatatgga agaaaattaagtgtgatcca gtttgatatt tttaggttgt tgaactggga 7500 ttacttaacc ttgaatgatgatatgtatgc acactgactt taagcttcat aaaaccatca 7560 gtgccaagaa attctctttgtagtaattac ttgttactga caccacagca gtatagcata 7620 tgtcacagct cctgtgattcaatgttataa aacaagcaga attttaaaag cagcactata 7680 tagctgtttt gtattatagtgtatatgatg tttgtgaaaa tgccagattt aaaatgatgt 7740 atttattttt ggtaaaaaataaaaaattct atgctatatt gttgatcaag tgtaaatgtg 7800 accttgtaca gtttactaaaattactgata tttttcacta cattgagaca gttactgtga 7860 gaataggaca caaacaccagctattgcctg catctgggaa attgctgaat cgcacagcag 7920 tcatgtcata atcagaaaattactgccaaa taattgtaaa atttgtaaag tataaagtat 7980 ataaagtaga tactaaatacagacacttca atattttgtt gaagctattg actgtacaat 8040 taaacatttt caaaaggtgtaatttattta aaattgtctc attttggtaa aatttatgtg 8100 aacttttaaa gctaaatattaaacttaata tgctatgtaa atatatacat atatacattt 8160 aatgatgtat ttttttaaaacattggcttg cttttgttaa agtgcaagtg ttacatatgg 8220 ctttgtacat taaagttgaaaggggtttta cattttccat taaaaggact ttatc 8275 11 8234 DNA Homo sapiens 11cgagagtggg ttaaagttta tgaagatttt tcaactttct tggtggaata ccacttaatc 60tgggccaaaa ggaatgaccc tagccagact cagggatcaa agagcaaaca gattcagtgg 120cctgcattga ctttcaaacc tctggttgaa agaaatatac ccagttcagt cactgcagta 180gaattccttg tagataagca actggatttt ttaactgaag atagtgcctt tcagccctac 240caggacgaca tagacagcct aaacccagtt ctcagggaca acccgcagct tcatgaggaa 300gtgaaagtct gggtaaagga acaaaaggtt caggagattt ttatgcaagg tccttattcc 360ttaaatggat acagagtgag agtatataga caagactctg ccacccagtg gtttactggc 420ataattactc atcatgatct cttcacccgc accatgatcg ttatgaatga tcaggtacta 480gaaccacaga atgtcgatcc ttctatggtt caaatgacct ttctagatga tgttgttcac 540tctttgttaa aaggtgaaaa tattggcatt acatcacgac gcaggtctcg tgccaatcaa 600aacgtcaacg ctgttcacag ccattataca cgtgcccaag caaatagtcc cagaccagca 660atgaactccc aagctgctgt accaaaacag aatacacacc agcaacagca acaaagaagt 720atccgtccaa ataagaggaa gggctcagat agcagtatac cagatgaaga gaagatgaag 780gaggaaaaat atgattatat atcacgagga gaaaatccta aaggtaaaaa caaacacttg 840atgaataaaa gaaggaaacc tgaggaggat gaaaagaaac taaatatgaa aagacttcga 900actgacaatg tttcagactt ttctgagagc agtgactcag aaaattcaaa taagagaata 960atagataatt cctcagaaca gaagccagag aatgaattga aaaataaaaa tacttcaaaa 1020ataaatggag aagaaggaaa accccataat aatgagaagg caggagaaga gaccctaaaa 1080aatagccagc ctccctggga tcaaatacag gaagataaaa aacatgaaga agcagagaag 1140cggaagtctg ttgacactca gcttcaagaa gatatgatta ttcattcgtc agaacagtcc 1200acagtttctg atcataattc taatgattta cttcctcagg aatgcaatat ggataaaaca 1260cataccatgg aattgctacc aaaggagaag tttgtatcca gaccacccac accaaaatgt 1320gttattgata ttacaaatga cactaattta gaaaaggtgg ctcaggaaaa ctcaagtacc 1380tttggccttc agacacttca gaaaatggat cctaatgtta gtgattcaaa acactctatt 1440gcaaatgcaa aattcttgga aacagcaaaa aaagattctg accagagctg ggtcagtgat 1500gtagttaaag tggatctaac ccaatcaagt gttacaaatg cttcttcagg aaatgatcac 1560ttgaacatgg aaaaagagaa gtatgtctct tacatttctc ctttaagtgc agtttctgtc 1620atggaagata agctgcataa gcgaagtcca cctccagaga ctataaaatc taaacttaat 1680acttcagtag atactcacaa gataaaatcc agcccatcac ctgaagttgt taaacccaaa 1740ataactcatt ctcctgattc tgtaaagtct aaggccactt atgtgaacag ccaagctact 1800ggtgaaagaa gattggcaaa taagatagaa catgagctat caagatgcag ttttcatcca 1860attcctactc gaagcagtac attagaaact acaaagagtc ctcttatcat tgataaaaat 1920gagcatttta cagtttacag agatcctgca cttattgggt cagaaacagg agctaatcat 1980atttcacctt tcctaagcca gcatcctttt cctcttcact cctcatctca tagaacctgt 2040ttaaatccag gtacccatca tcctgcctta actcctgcac cccatttact agccggatca 2100tctagtcaaa ctccattacc taccattaac actcatcctc tgactagtgg tccacaccat 2160gctgttcatc accctcattt acttcccact gtgttacctg gagtgcctac tgcctcctta 2220cttggtggcc acccacgact agagagtgct catgccagca gcttgagcca cttagcgcta 2280gcacaccagc aacaacaaca gttgttacag caccagtcac ctcatcttct tggacaagcc 2340catccttctg cttcatataa tcagcttgga ctttatccaa ttatttggca gtatccaaat 2400ggaacacatg catactcagg acttggtttg ccttcttcta agtgggttca cccagaaaat 2460gcagttaatg ctgaagcttc attaaggagg aattctccca gtccttggct acatcagccc 2520acccctgtga cctcagcaga tggtattgga ttacttagtc acattcctgt cagaccttcc 2580agtgcagagc ctcatcggcc tcttaaaatt acagcccatt ccagtccacc attgacaaaa 2640actttagtag atcatcataa ggaagaatta gaaagaaaag cttttatgga accattacgg 2700tctgttgcat ccacatcagc caaaaatgac ctggatctaa ataggtcaca gactggaaaa 2760gattgtcact tacataggca ttttgtggat ccagtattaa atcagttaca gaggccaccc 2820caggagactg gagagaggtt aaacaaatac aaagaggaac accgtcgaat tcttcaagaa 2880agtattgatg ttgctccctt tacaactaaa atcaagggac ttgagggtga gagagagaat 2940tattccagag tggcatcatc atcttccagt cctaaaagcc atatcatcaa acaagatatg 3000gatgtagaac gctcagtatc agatctttat aaaatgaagc actcagtgcc tcagagttta 3060ccccaaagta actatttcac tacattgtct aatagtgtgg tcaatgaacc accaagatca 3120tacccatcca aagaagtttc aaatatttac ggtgataaac agagtaatgc ccttgcagcg 3180gcagcagcta atcctcaaac tctgacttca tttataacat ctctttcaaa gcctccacct 3240ttgattaaac accaaccaga aagtgaaggt ttagtaggca agataccaga acatcttcca 3300catcagattg catctcactc agtaacaacc ttcagaaatg attgtaggag tcctacccat 3360ttgacagttt cttctacaaa tacactccgc agtatgcctg cattacatag agcaccagta 3420tttcacccac caatccatca cagcctggaa agaaaggaag gcagctatag tagtctttcc 3480cctccaactt taactccggt gatgccagta aatgctggtg gtaaagttca agaatcacag 3540aagcctccaa ctctaatacc cgaaccaaaa gactcccagg caaattttaa gagttcttca 3600gaacagagtt tgacggagat gtggagacct aataataacc tcagcaaaga gaaaactgaa 3660tggcatgtgg agaaaagcag cggaaagtta caggctgcta tggcatctgt cattgtgcgt 3720ccatcttcta gtacaaaaac tgatagtatg ccagcaatgc agttagcttc taaagatcgw 3780gttagtgaaa gatcttcagc tggggcacat aaaacagatt gcctcaaact agcagaagcc 3840ggagaaactg gaagaatcat tttgccaaat gtgaattcag acagtgttca cacaaaatct 3900gaaaaaaact ttcaggctgt ctcacagggc agtgttccca gttcagtcat gtctgctgta 3960aatacgatgt gtaataccaa aacggatgta atcacatctg ctgccgatac taccagtgtt 4020tccagctggg gtggttcaga agtaatttcc tctttatcaa ataccatttt ggcctctaca 4080tcatcagaat gtgtatcttc aaaaagtgtc agtcagccag tggctcaaaa acaagaatgc 4140aaggtcagca ccacagcacc agttacatta gccagtagta agacaggaag tgttgttcaa 4200cccagttctg ggttctcagg cacaactgat tttatccatt taaaaaagca caaggcagca 4260ttggctgcag ctcagtataa aagtagtaat gccagtgaga ctgaacctaa tgctataaaa 4320aatcagacac tttcagcctc ccttcctctg gatagcactg taatctgtag tacaattaac 4380aaagcaaact ctgtaggaaa tgggcaagct tcccagacaa gtcaaccaaa ctaccatact 4440aaactgaaaa aggcctggct caccagacac tcagaagaag ataaaaatac taataaaatg 4500gaaaattcag ggaattctgt atcagaaatt attaagccat gttctgtcaa cttaatagcc 4560tctacatcta gtgatataca aaatagtgta gatagtaaga tcatagttga taaatatgta 4620aaagatgata aagtcaacag gagaaaagcc aaaagaactt atgaatctgg ctctgaaagt 4680ggagactcag atgaaagtga aagcaagtca gagcaaagga ctaaaagaca acctaagcca 4740acttacaaaa agaagcaaaa tgatttgcaa aagagaaaag gtgaaataga agaagatttg 4800aaacccaatg gagttctcag caggagtgcc aaagaaagaa gtaaactgaa gttgcaaagc 4860aacagtaata ctggcattcc tcgttcagta ttgaaagatt ggcgtaaagt caagaagctg 4920aagcaaactg gggaatcctt tttacaggat gactcctgct gtgagatagg gcctaattta 4980caaaagtgtc gagaatgtag acttattcgc agtaaaaaag gagaagaacc agctcactca 5040ccagtatttt gtagatttta ctactttaga cggttgtcat ttagtaaaaa cggagtagtt 5100agaatagatg gtttctcttc tcctgaccaa tatgatgatg aagctatgag tttgtggaca 5160catgaaaatt ttgaagatga tgaactagat atagagactt ctaaatatat cttggatata 5220ataggtgata agttctgtca attagtaaca tctgaaaaaa cagctttgtc ctgggtgaaa 5280aaggatgcca aaattgcctg gaaaagagca gtgagaggag tccgggagat gtgtgatgca 5340tgtgaagcaa cattgtttaa cattcactgg gtctgccaaa aatgtggatt tgtggtctgc 5400ttagattgtt acaaggcaaa ggaaaggaag agttctagag ataaagaact atatgcttgg 5460atgaagtgtg tgaagggaca gcctcatgat cacaaacatt taatgccaac ccaaattata 5520cctggttctg ttttgacaga tcttctagat gccatgcaca ctcttaggga aaaatatggt 5580attaaatccc attgtcattg tactaacaaa cagaatttac aagttggaaa ttttcctaca 5640atgaatggtg tatctcaagt tttacagaat gttcttaatc acagtaataa aatttctctg 5700tgcatgcctg agtctcagca gcaaaatact cctccgaagt ctgagaaaaa tggtggcagc 5760agcccagaga gtgatgtagg cacagataac aagttaactc ctccagaatc ccagtcacca 5820ctgcactggt tagcagatct tgcagagcaa aaagccagag aggaaaaaaa agaaaacaaa 5880gaacttaccc ttgaaaacca aattaaagaa gaaagagaac aagacaactc tgaatctcca 5940aatggcagaa catcacctct tgtgtcccag aataatgaac aaggctcaac cttacgggat 6000ttgctgacta caacagctgg aaagctacgt gtggggtcta cagatgctgg cattgccttt 6060gccccagtat attcaatggg agccccaagt agcaaaagtg gacggactat gcctaacatt 6120cttgatgaca taattgcttc agttgttgaa aacaaaattc caccaagtaa aacctccaag 6180ataaatgtaa aaccagagct taaagaagag cctgaagaaa gcataatatc tgcagtggat 6240gaaaataata aattatacag tgatatacca cattcttgga tctgtgagaa gcatatttta 6300tggcttaagg attataagaa tagcagtaat tggaagcttt tcaaagaatg ttggaaacaa 6360ggacagcctg cagtggtttc tggtgtgcat aagaaaatga acattagcct atggaaggcg 6420gaatcaatta gtcttgattt tggagaccac caagctgatc tcctgaactg caaagatagc 6480atcatttcaa atgccaatgt taaggaattc tgggatggtt ttgaagaagt ttcaaaacgg 6540cagaaaaaca agagtggaga aacagttgtt ttaaaattga aagactggcc ttcaggagaa 6600gacttcaaga ctatgatgcc agcaagatac gaagatcttt taaaaagtct gccattgcca 6660gaatattgta atccagaagg aaaattcaat ttggcctctc atttgccagg attttttgta 6720cgtcctgatc taggacccag gttgtgcagt gcctatggtg tagttgctgc taaagatcat 6780gatataggaa caacaaatct ccatattgaa gtttctgatg ttgtaaatat actagtttat 6840gttggcatag caaaaggaaa tggcattctc tcaaaagcag gaattctcaa gaaatttgag 6900gaagaagatt tggatgacat tttaaggaaa agattgaagg actcaagtga aatacctggt 6960gctctgtggc atatttatgc tgggaaagat gttgacaaga taagggaatt tcttcaaaag 7020atttcaaaag aacaaggcct tgaagttcta ccagaacatg atccaatacg tgaccaaagt 7080tggtatgtga acaaaaagct ccgtcaaagg ctgcttgaag aatatggagt cagaacctgt 7140actcttattc agttccttgg tgatgctatt gttttgccag cgggagcact tcatcaggtt 7200cagaattttc acagctgtat tcaggtaact gaagattttg tgtctccaga acatcttgta 7260gagtcatttc atttaacaca ggaactgaga cttttgaagg aagaaatcaa ttatgatgat 7320aaactacagg ttaaaaatat tttgtatcat gcagtcaaag aaatggtgag agccttgaag 7380atacacgagg atgaagtaga ggatatggaa gaaaattaag tgtgatccag tttgatattt 7440ttaggttgtt gaactgggat tacttaacct tgaatgatga tatgtatgca cactgacttt 7500aagcttcata aaaccatcag tgccaagaaa ttctctttgt agtaattact tgttactgac 7560accacagcag tatagcatat gtcacagctc ctgtgattca atgttataaa acaagcagaa 7620ttttaaaagc agcactatat agctgttttg tattatagtg tatatgatgt ttgtgaaaat 7680gccagattta aaatgatgta tttatttttg gtaaaaaata aaaaattcta tgctatattg 7740ttgatcaagt gtaaatgtga ccttgtacag tttactaaaa ttactgatat ttttcactac 7800attgagacag ttactgtgag aataggacac aaacaccagc tattgcctgc atctgggaaa 7860ttgctgaatc gcacagcagt catgtcataa tcagaaaatt actgccaaat aattgtaaaa 7920tttgtaaagt ataaagtata taaagtagat actaaataca gacacttcaa tattttgttg 7980aagctattga ctgtacaatt aaacattttc aaaaggtgta atttatttaa aattgtctca 8040ttttggtaaa atttatgtga acttttaaag ctaaatatta aacttaatat gctatgtaaa 8100tatatacata tatacattta atgatgtatt tttttaaaac attggcttgc ttttgttaaa 8160gtgcaagtgt tacatatggc tttgtacatt aaagttgaaa ggggttttac attttccatt 8220aaaaggactt tatc 8234 12 8128 DNA Homo sapiens 12 cgagagtggg ttaaagtttatgaagatttt tcaactttct tggtggaata ccacttaatc 60 tgggccaaaa ggaatgaccctagccagact cagggatcaa agagcaaaca gattcagtgg 120 cctgcattga ctttcaaacctctggttgaa agaaatatac ccagttcagt cactgcagta 180 gaattccttg tagataagcaactggatttt ttaactgaag atagtgcctt tcagccctac 240 caggtcctta ttccttaaatggatacagag tgagagtata tagacaagac tctgccaccc 300 agtggtttac tggcataattactcatcatg atctcttcac ccgcaccatg atcgttatga 360 atgatcaggt actagaaccacagaatgtcg atccttctat ggttcaaatg acctttctag 420 atgatgttgt tcactctttgttaaaaggtg aaaatattgg cattacatca cgacgcaggt 480 ctcgtgccaa tcaaaacgtcaacgctgttc acagccatta tacacgtgcc caagcaaata 540 gtcccagacc agcaatgaactcccaagctg ctgtaccaaa acagaataca caccagcaac 600 agcaacaaag aagtatccgtccaaataaga ggaagggctc agatagcagt ataccagatg 660 aagagaagat gaaggaggaaaaatatgatt atatatcacg aggagaaaat cctaaaggta 720 aaaacaaaca cttgatgaataaaagaagga aacctgagga ggatgaaaag aaactaaata 780 tgaaaagact tcgaactgacaatgtttcag acttttctga gagcagtgac tcagaaaatt 840 caaataagag aataatagataattcctcag aacagaagcc agagaatgaa ttgaaaaata 900 aaaatacttc aaaaataaatggagaagaag gaaaacccca taataatgag aaggcaggag 960 aagagaccct aaaaaatagccagcctccct gggatcaaat acaggaagat aaaaaacatg 1020 aagaagcaga gaagcggaagtctgttgaca ctcagcttca agaagatatg attattcatt 1080 cgtcagaaca gtccacagtttctgatcata attctaatga tttacttcct caggaatgca 1140 atatggataa aacacataccatggaattgc taccaaagga gaagtttgta tccagaccac 1200 ccacaccaaa atgtgttattgatattacaa atgacactaa tttagaaaag gtggctcagg 1260 aaaactcaag tacctttggccttcagacac ttcagaaaat ggatcctaat gttagtgatt 1320 caaaacactc tattgcaaatgcaaaattct tggaaacagc aaaaaaagat tctgaccaga 1380 gctgggtcag tgatgtagttaaagtggatc taacccaatc aagtgttaca aatgcttctt 1440 caggaaatga tcacttgaacatggaaaaag agaagtatgt ctcttacatt tctcctttaa 1500 gtgcagtttc tgtcatggaagataagctgc ataagcgaag tccacctcca gagactataa 1560 aatctaaact taatacttcagtagatactc acaagataaa atccagccca tcacctgaag 1620 ttgttaaacc caaaataactcattctcctg attctgtaaa gtctaaggcc acttatgtga 1680 acagccaagc tactggtgaaagaagattgg caaataagat agaacatgag ctatcaagat 1740 gcagttttca tccaattcctactcgaagca gtacattaga aactacaaag agtcctctta 1800 tcattgataa aaatgagcattttacagttt acagagatcc tgcacttatt gggtcagaaa 1860 caggagctaa tcatatttcacctttcctaa gccagcatcc ttttcctctt cactcctcat 1920 ctcatagaac ctgtttaaatccaggtaccc atcatcctgc cttaactcct gcaccccatt 1980 tactagccgg atcatctagtcaaactccat tacctaccat taacactcat cctctgacta 2040 gtggtccaca ccatgctgttcatcaccctc atttacttcc cactgtgtta cctggagtgc 2100 ctactgcctc cttacttggtggccacccac gactagagag tgctcatgcc agcagcttga 2160 gccacttagc gctagcacaccagcaacaac aacagttgtt acagcaccag tcacctcatc 2220 ttcttggaca agcccatccttctgcttcat ataatcagct tggactttat ccaattattt 2280 ggcagtatcc aaatggaacacatgcatact caggacttgg tttgccttct tctaagtggg 2340 ttcacccaga aaatgcagttaatgctgaag cttcattaag gaggaattct cccagtcctt 2400 ggctacatca gcccacccctgtgacctcag cagatggtat tggattactt agtcacattc 2460 ctgtcagacc ttccagtgcagagcctcatc ggcctcttaa aattacagcc cattccagtc 2520 caccattgac aaaaactttagtagatcatc ataaggaaga attagaaaga aaagctttta 2580 tggaaccatt acggtctgttgcatccacat cagccaaaaa tgacctggat ctaaataggt 2640 cacagactgg aaaagattgtcacttacata ggcattttgt ggatccagta ttaaatcagt 2700 tacagaggcc accccaggagactggagaga ggttaaacaa atacaaagag gaacaccgtc 2760 gaattcttca agaaagtattgatgttgctc cctttacaac taaaatcaag ggacttgagg 2820 gtgagagaga gaattattccagagtggcat catcatcttc cagtcctaaa agccatatca 2880 tcaaacaaga tatggatgtagaacgctcag tatcagatct ttataaaatg aagcactcag 2940 tgcctcagag tttaccccaaagtaactatt tcactacatt gtctaatagt gtggtcaatg 3000 aaccaccaag atcatacccatccaaagaag tttcaaatat ttacggtgat aaacagagta 3060 atgcccttgc agcggcagcagctaatcctc aaactctgac ttcatttata acatctcttt 3120 caaagcctcc acctttgattaaacaccaac cagaaagtga aggtttagta ggcaagatac 3180 cagaacatct tccacatcagattgcatctc actcagtaac aaccttcaga aatgattgta 3240 ggagtcctac ccatttgacagtttcttcta caaatacact ccgcagtatg cctgcattac 3300 atagagcacc agtatttcacccaccaatcc atcacagcct ggaaagaaag gaaggcagct 3360 atagtagtct ttcccctccaactttaactc cggtgatgcc agtaaatgct ggtggtaaag 3420 ttcaagaatc acagaagcctccaactctaa tacccgaacc aaaagactcc caggcaaatt 3480 ttaagagttc ttcagaacagagtttgacgg agatgtggag acctaataat aacctcagca 3540 aagagaaaac tgaatggcatgtggagaaaa gcagcggaaa gttacaggct gctatggcat 3600 ctgtcattgt gcgtccatcttctagtacaa aaactgatag tatgccagca atgcagttag 3660 cttctaaaga tcgagttagtgaaagatctt cagctggggc acataaaaca gattgcctca 3720 aactagcaga agccggagaaactggaagaa tcattttgcc aaatgtgaat tcagacagtg 3780 ttcacacaaa atctgaaaaaaactttcagg ctgtctcaca gggcagtgtt cccagttcag 3840 tcatgtctgc tgtaaatacgatgtgtaata ccaaaacgga tgtaatcaca tctgctgccg 3900 atactaccag tgtttccagctggggtggtt cagaagtaat ttcctcttta tcaaatacca 3960 ttttggcctc tacatcatcagaatgtgtat cttcaaaaag tgtcagtcag ccagtggctc 4020 aaaaacaaga atgcaaggtcagcaccacag caccagttac attagccagt agtaagacag 4080 gaagtgttgt tcaacccagttctgggttct caggcacaac tgattttatc catttaaaaa 4140 agcacaaggc agcattggctgcagctcagt ataaaagtag taatgccagt gagactgaac 4200 ctaatgctat aaaaaatcagacactttcag cctcccttcc tctggatagc actgtaatct 4260 gtagtacaat taacaaagcaaactctgtag gaaatgggca agcttcccag acaagtcaac 4320 caaactacca tactaaactgaaaaaggcct ggctcaccag acactcagaa gaagataaaa 4380 atactaataa aatggaaaattcagggaatt ctgtatcaga aattattaag ccatgttctg 4440 tcaacttaat agcctctacatctagtgata tacaaaatag tgtagatagt aagatcatag 4500 ttgataaata tgtaaaagatgataaagtca acaggagaaa agccaaaaga acttatgaat 4560 ctggctctga aagtggagactcagatgaaa gtgaaagcaa gtcagagcaa aggactaaaa 4620 gacaacctaa gccaacttacaaaaagaagc aaaatgattt gcaaaagaga aaaggtgaaa 4680 tagaagaaga tttgaaacccaatggagttc tcagcaggag tgccaaagaa agaagtaaac 4740 tgaagttgca aagcaacagtaatactggca ttcctcgttc agtattgaaa gattggcgta 4800 aagtcaagaa gctgaagcaaactggggaat cctttttaca ggatgactcc tgctgtgaga 4860 tagggcctaa tttacaaaagtgtcgagaat gtagacttat tcgcagtaaa aaaggagaag 4920 aaccagctca ctcaccagtattttgtagat tttactactt tagacggttg tcatttagta 4980 aaaacggagt agttagaatagatggtttct cttctcctga ccaatatgat gatgaagcta 5040 tgagtttgtg gacacatgaaaattttgaag atgatgaact agatatagag acttctaaat 5100 atatcttgga tataataggtgataagttct gtcaattagt aacatctgaa aaaacagctt 5160 tgtcctgggt gaaaaaggatgccaaaattg cctggaaaag agcagtgaga ggagtccggg 5220 agatgtgtga tgcatgtgaagcaacattgt ttaacattca ctgggtctgc caaaaatgtg 5280 gatttgtggt ctgcttagattgttacaagg caaaggaaag gaagagttct agagataaag 5340 aactatatgc ttggatgaagtgtgtgaagg gacagcctca tgatcacaaa catttaatgc 5400 caacccaaat tatacctggttctgttttga cagatcttct agatgccatg cacactctta 5460 gggaaaaata tggtattaaatcccattgtc attgtactaa caaacagaat ttacaagttg 5520 gaaattttcc tacaatgaatggtgtatctc aagttttaca gaatgttctt aatcacagta 5580 ataaaatttc tctgtgcatgcctgagtctc agcagcaaaa tactcctccg aagtctgaga 5640 aaaatggtgg cagcagcccagagagtgatg taggcacaga taacaagtta actcctccag 5700 aatcccagtc accactgcactggttagcag atcttgcaga gcaaaaagcc agagaggaaa 5760 aaaaagaaaa caaagaacttacccttgaaa accaaattaa agaagaaaga gaacaagaca 5820 actctgaatc tccaaatggcagaacatcac ctcttgtgtc ccagaataat gaacaaggct 5880 caaccttacg ggatttgctgactacaacag ctggaaagct acgtgtgggg tctacagatg 5940 ctggcattgc ctttgccccagtatattcaa tgggagcccc aagtagcaaa agtggacgga 6000 ctatgcctaa cattcttgatgacataattg cttcagttgt tgaaaacaaa attccaccaa 6060 gtaaaacctc caagataaatgtaaaaccag agcttaaaga agagcctgaa gaaagcataa 6120 tatctgcagt ggatgaaaataataaattat acagtgatat accacattct tggatctgtg 6180 agaagcatat tttatggcttaaggattata agaatagcag taattggaag cttttcaaag 6240 aatgttggaa acaaggacagcctgcagtgg tttctggtgt gcataagaaa atgaacatta 6300 gcctatggaa ggcggaatcaattagtcttg attttggaga ccaccaagct gatctcctga 6360 actgcaaaga tagcatcatttcaaatgcca atgttaagga attctgggat ggttttgaag 6420 aagtttcaaa acggcagaaaaacaagagtg gagaaacagt tgttttaaaa ttgaaagact 6480 ggccttcagg agaagacttcaagactatga tgccagcaag atacgaagat cttttaaaaa 6540 gtctgccatt gccagaatattgtaatccag aaggaaaatt caatttggcc tctcatttgc 6600 caggattttt tgtacgtcctgatctaggac ccaggttgtg cagtgcctat ggtgtagttg 6660 ctgctaaaga tcatgatataggaacaacaa atctccatat tgaagtttct gatgttgtaa 6720 atatactagt ttatgttggcatagcaaaag gaaatggcat tctctcaaaa gcaggaattc 6780 tcaagaaatt tgaggaagaagatttggatg acattttaag gaaaagattg aaggactcaa 6840 gtgaaatacc tggtgctctgtggcatattt atgctgggaa agatgttgac aagataaggg 6900 aatttcttca aaagatttcaaaagaacaag gccttgaagt tctaccagaa catgatccaa 6960 tacgtgacca aagttggtatgtgaacaaaa agctccgtca aaggctgctt gaagaatatg 7020 gagtcagaac ctgtactcttattcagttcc ttggtgatgc tattgttttg ccagcgggag 7080 cacttcatca ggttcagaattttcacagct gtattcaggt aactgaagat tttgtgtctc 7140 cagaacatct tgtagagtcatttcatttaa cacaggaact gagacttttg aaggaagaaa 7200 tcaattatga tgataaactacaggttaaaa atattttgta tcatgcagtc aaagaaatgg 7260 tgagagcctt gaagatacacgaggatgaag tagaggatat ggaagaaaat taagtgtgat 7320 ccagtttgat atttttaggttgttgaactg ggattactta accttgaatg atgatatgta 7380 tgcacactga ctttaagcttcataaaacca tcagtgccaa gaaattctct ttgtagtaat 7440 tacttgttac tgacaccacagcagtatagc atatgtcaca gctcctgtga ttcaatgtta 7500 taaaacaagc agaattttaaaagcagcact atatagctgt tttgtattat agtgtatatg 7560 atgtttgtga aaatgccagatttaaaatga tgtatttatt tttggtaaaa aataaaaaat 7620 tctatgctat attgttgatcaagtgtaaat gtgaccttgt acagtttact aaaattactg 7680 atatttttca ctacattgagacagttactg tgagaatagg acacaaacac cagctattgc 7740 ctgcatctgg gaaattgctgaatcgcacag cagtcatgtc ataatcagaa aattactgcc 7800 aaataattgt aaaatttgtaaagtataaag tatataaagt agatactaaa tacagacact 7860 tcaatatttt gttgaagctattgactgtac aattaaacat tttcaaaagg tgtaatttat 7920 ttaaaattgt ctcattttggtaaaatttat gtgaactttt aaagctaaat attaaactta 7980 atatgctatg taaatatatacatatataca tttaatgatg tattttttta aaacattggc 8040 ttgcttttgt taaagtgcaagtgttacata tggctttgta cattaaagtt gaaaggggtt 8100 ttacattttc cattaaaaggactttatc 8128 13 3781 DNA Homo sapiens 13 ctgatagtat gccagcaatgcagttagctt ctaaagatcg agttagtgaa agatcttcag 60 ctggggcaca taaaacagattgcctcaaac tagcagaagc cggagaaact ggaagaatca 120 ttttgccaaa tgtgaattcagacagtgttc acacaaaatc tgaaaaaaac tttcaggctg 180 tctcacaggg cagtgttcccagttcagtca tgtctgctgt aaatacgatg tgtaatacca 240 aaacggatgt aatcacatctgctgccgata ctaccagtgt ttccagctgg ggtggttcag 300 aagtaatttc ctctttatcaaataccattt tggcctctac atcatcagaa tgtgtatctt 360 caaaaagtgt cagtcagccagtggctcaaa aacaagaatg caaggtcagc accacagcac 420 cagttacatt agccagtagtaagacaggaa gtgttgttca acccagttct gggttctcag 480 gcacaactga ttttatccatttaaaaaagc acaaggcagc attggctgca gctcagtata 540 aaagtagtaa tgccagtgagactgaaccta atgctataaa aaatcagaca ctttcagcct 600 cccttcctct ggatagcactgtaatctgta gtacaattaa caaagcaaac tctgtaggaa 660 atgggcaagc ttcccagacaagtcaaccaa actaccatac taaactgaaa aaggcctggc 720 tcaccagaca ctcagaagaagataaaaata ctaataaaat ggaaaattca gggaattctg 780 tatcagaaat tattaagccatgttctgtca acttaatagc ctctacatct agtgatatac 840 aaaatagtgt agatagtaagatcatagttg ataaatatgt aaaagatgat aaagtcaaca 900 ggagaaaagc caaaagaacttatgaatctg gctctgaaag tggagactca gatgaaagtg 960 aaagcaagtc agagcaaaggactaaaagac aacctaagcc aacttacaaa aagaagcaaa 1020 atgatttgca aaagagaaaaggtgaaatag aagaagattt gaaacccaat ggagttctca 1080 gcaggagtgc caaagaaagaagtaaactga agttgcaaag caacagtaat accaaaattg 1140 cctggaaaag agcagtgagaggagtccggg agatgtgtga tgcatgtgaa gcaacattgt 1200 ttaacattca ctgggtctgccaaaaatgtg gatttgtggt ctgcttagat tgttacaagg 1260 caaaggaaag gaagagttctagagataaag aactatatgc ttggatgaag tgtgtgaagg 1320 gacagcctca tgatcacaaacatttaatgc caacccaaat tatacctggt tctgttttga 1380 cagatcttct agatgccatgcacactctta gggaaaaata tggtattaaa tcccattgtc 1440 attgtactaa caaacagaatttacaagttg gaaattttcc tacaatgaat ggtgtatctc 1500 aagttttaca gaatgttcttaatcacagta ataaaatttc tctgtgcatg cctgagtctc 1560 agcagcaaaa tactcctccgaagtctgaga aaaatggtgg cagcagccca gagagtgatg 1620 taggcacaga taacaagttaactcctccag aatcccagtc accactgcac tggttagcag 1680 atcttgcaga gcaaaaagccagagaggaaa aaaaagaaaa caaagaactt acccttgaaa 1740 accaaattaa agaagaaagagaacaagaca actctgaatc tccaaatggc agaacatcac 1800 ctcttgtgtc ccagaataatgaacaaggct caaccttacg ggatttgctg actacaacag 1860 ctggaaagct acgtgtggggtctacagatg ctggcattgc ctttgcccca gtatattcaa 1920 tgggagcccc aagtagcaaaagtggacgga ctatgcctaa cattcttgat gacataattg 1980 cttcagttgt tgaaaacaaaattccaccaa gtaaaacctc caagataaat gtaaaaccag 2040 agcttaaaga agagcctgaagaaagcataa tatctgcagt ggatgaaaat aataaattat 2100 acagtgatat accacattcttggatctgtg agaagcatat tttatggctt aaggattata 2160 agaatagcag taattggaagcttttcaaag aatgttggaa acaaggacag cctgcagtgg 2220 tttctggtgt gcataagaaaatgaacatta gcctatggaa ggcggaatca attagtcttg 2280 attttggaga ccaccaagctgatctcctga actgcaaaga tagcatcatt tcaaatgcca 2340 atgttaagga attctgggatggttttgaag aagtttcaaa acggcagaaa aacaagagtg 2400 gagaaacagt tgttttaaaattgaaagatt ggccttcagg agaagacttc aagactatga 2460 tgccagcaag atacgaagatctcttaaaaa gtctgccatt gccagaatat tgtaatccag 2520 aaggaaaatt caatttggcctctcatttgc caggattttt tgtacgtcct gatctaggac 2580 ccaggttgtg cagtgcctatggtgtagttg ctgctaaaga tcatgatata ggaacaacaa 2640 atctccatat tgaagtttctgatgttgtaa atatactagt ttatgttggc atagcaaaag 2700 gaaatggcat tctctcaaaagcaggaattc tcaagaaatt tgaggaagaa gatttggatg 2760 acattttaag gaaaagattgaaggactcaa gtgaaatacc tggtgctctg tggcatattt 2820 atgctgggaa agatgttgacaagataaggg aatttcttca aaaggtataa gttagtttgc 2880 aaataatatc tcttcccctcaagcaaggac ttattcattg ttgccgtacc ttcacttggc 2940 attcaacaat ataagaccattacataattt ttacttcaaa gtagatggga gtttttgttg 3000 aaagtcaaaa ccctgtttctcttgctcctc tttgaaagtc tgttaacatt ttgagggttt 3060 tgaatggtga tttgttcccagttggtagtg tgcctccatt ccctgttaat aaaagactga 3120 tctgttcttg agccagatattagttgggtg gcgaaaagta gctatctgga gtcgcgggat 3180 gtaaaaacaa atatttcatggaaacgtcta aagaaggtag gatctttatt ttctttgaat 3240 ctgatagata tttttggctgctgacctagg gaatgattct actctatcac atctgatact 3300 atgatactta ctgagtagactcagattttg agtaatcata ccttgactgt agttgtcata 3360 tactctgaaa gaaatttatatatcaacctg aaataattgg ttgaaaccct ttgctcaggt 3420 actttttaaa cttcgtaattatggaaattt gttttaagaa atgttcagtg ctgatacatc 3480 atctttctga aaggatgtaagttctgtctg tgatcaatgt gaagtaaaag agttacagcc 3540 ttttttgtac cattttatccctgaatactt acctgtattt taatctgaag tatgatcatt 3600 tgtgccttct aaagcagattatttaactga ttaaagagtc ccttgaaatt gatttttcaa 3660 gcgttagaag gttagccatgtaaaggtatt tatacttacc tcaaaaaagg atcaataaat 3720 gtttatgaaa tccatacatatgaatggttt ggcttttggt aaaaaaaaaa aaaaaaaaaa 3780 a 3781 14 2358 PRT Homosapiens 14 Met Gln Gly Pro Tyr Ser Leu Asn Gly Tyr Arg Val Arg Val TyrArg 1 5 10 15 Gln Asp Ser Ala Thr Gln Trp Phe Thr Gly Ile Ile Thr HisHis Asp 20 25 30 Leu Phe Thr Arg Thr Met Ile Val Met Asn Asp Gln Val LeuGlu Pro 35 40 45 Gln Asn Val Asp Pro Ser Met Val Gln Met Thr Phe Leu AspAsp Val 50 55 60 Val His Ser Leu Leu Lys Gly Glu Asn Ile Gly Ile Thr SerArg Arg 65 70 75 80 Arg Ser Arg Ala Asn Gln Asn Val Asn Ala Val His SerHis Tyr Thr 85 90 95 Arg Ala Gln Ala Asn Ser Pro Arg Pro Ala Met Asn SerGln Ala Ala 100 105 110 Val Pro Lys Gln Asn Thr His Gln Gln Gln Gln GlnArg Ser Ile Arg 115 120 125 Pro Asn Lys Arg Lys Gly Ser Asp Ser Ser IlePro Asp Glu Glu Lys 130 135 140 Met Lys Glu Glu Lys Tyr Asp Tyr Ile SerArg Gly Glu Asn Pro Lys 145 150 155 160 Gly Lys Asn Lys His Leu Met AsnLys Arg Arg Lys Pro Glu Glu Asp 165 170 175 Glu Lys Lys Leu Asn Met LysArg Leu Arg Thr Asp Asn Val Ser Asp 180 185 190 Phe Ser Glu Ser Ser AspSer Glu Asn Ser Asn Lys Arg Ile Ile Asp 195 200 205 Asn Ser Ser Glu GlnLys Pro Glu Asn Glu Leu Lys Asn Lys Asn Thr 210 215 220 Ser Lys Ile AsnGly Glu Glu Gly Lys Pro His Asn Asn Glu Lys Ala 225 230 235 240 Gly GluGlu Thr Leu Lys Asn Ser Gln Pro Pro Trp Asp Gln Ile Gln 245 250 255 GluAsp Lys Lys His Glu Glu Ala Glu Lys Arg Lys Ser Val Asp Thr 260 265 270Gln Leu Gln Glu Asp Met Ile Ile His Ser Ser Glu Gln Ser Thr Val 275 280285 Ser Asp His Asn Ser Asn Asp Leu Leu Pro Gln Glu Cys Asn Met Asp 290295 300 Lys Thr His Thr Met Glu Leu Leu Pro Lys Glu Lys Phe Val Ser Arg305 310 315 320 Pro Pro Thr Pro Lys Cys Val Ile Asp Ile Thr Asn Asp ThrAsn Leu 325 330 335 Glu Lys Val Ala Gln Glu Asn Ser Ser Thr Phe Gly LeuGln Thr Leu 340 345 350 Gln Lys Met Asp Pro Asn Val Ser Asp Ser Lys HisSer Ile Ala Asn 355 360 365 Ala Lys Phe Leu Glu Thr Ala Lys Lys Asp SerAsp Gln Ser Trp Val 370 375 380 Ser Asp Val Val Lys Val Asp Leu Thr GlnSer Ser Val Thr Asn Ala 385 390 395 400 Ser Ser Gly Asn Asp His Leu AsnMet Glu Lys Glu Lys Tyr Val Ser 405 410 415 Tyr Ile Ser Pro Leu Ser AlaVal Ser Val Met Glu Asp Lys Leu His 420 425 430 Lys Arg Ser Pro Pro ProGlu Thr Ile Lys Ser Lys Leu Asn Thr Ser 435 440 445 Val Asp Thr His LysIle Lys Ser Ser Pro Ser Pro Glu Val Val Lys 450 455 460 Pro Lys Ile ThrHis Ser Pro Asp Ser Val Lys Ser Lys Ala Thr Tyr 465 470 475 480 Val AsnSer Gln Ala Thr Gly Glu Arg Arg Leu Ala Asn Lys Ile Glu 485 490 495 HisGlu Leu Ser Arg Cys Ser Phe His Pro Ile Pro Thr Arg Ser Ser 500 505 510Thr Leu Glu Thr Thr Lys Ser Pro Leu Ile Ile Asp Lys Asn Glu His 515 520525 Phe Thr Val Tyr Arg Asp Pro Ala Leu Ile Gly Ser Glu Thr Gly Ala 530535 540 Asn His Ile Ser Pro Phe Leu Ser Gln His Pro Phe Pro Leu His Ser545 550 555 560 Ser Ser His Arg Thr Cys Leu Asn Pro Gly Thr His His ProAla Leu 565 570 575 Thr Pro Ala Pro His Leu Leu Ala Gly Ser Ser Ser GlnThr Pro Leu 580 585 590 Pro Thr Ile Asn Thr His Pro Leu Thr Ser Gly ProHis His Ala Val 595 600 605 His His Pro His Leu Leu Pro Thr Val Leu ProGly Val Pro Thr Ala 610 615 620 Ser Leu Leu Gly Gly His Pro Arg Leu GluSer Ala His Ala Ser Ser 625 630 635 640 Leu Ser His Leu Ala Leu Ala HisGln Gln Gln Gln Gln Leu Leu Gln 645 650 655 His Gln Ser Pro His Leu LeuGly Gln Ala His Pro Ser Ala Ser Tyr 660 665 670 Asn Gln Leu Gly Leu TyrPro Ile Ile Trp Gln Tyr Pro Asn Gly Thr 675 680 685 His Ala Tyr Ser GlyLeu Gly Leu Pro Ser Ser Lys Trp Val His Pro 690 695 700 Glu Asn Ala ValAsn Ala Glu Ala Ser Leu Arg Arg Asn Ser Pro Ser 705 710 715 720 Pro TrpLeu His Gln Pro Thr Pro Val Thr Ser Ala Asp Gly Ile Gly 725 730 735 LeuLeu Ser His Ile Pro Val Arg Pro Ser Ser Ala Glu Pro His Arg 740 745 750Pro Leu Lys Ile Thr Ala His Ser Ser Pro Pro Leu Thr Lys Thr Leu 755 760765 Val Asp His His Lys Glu Glu Leu Glu Arg Lys Ala Phe Met Glu Pro 770775 780 Leu Arg Ser Val Ala Ser Thr Ser Ala Lys Asn Asp Leu Asp Leu Asn785 790 795 800 Arg Ser Gln Thr Gly Lys Asp Cys His Leu His Arg His PheVal Asp 805 810 815 Pro Val Leu Asn Gln Leu Gln Arg Pro Pro Gln Glu ThrGly Glu Arg 820 825 830 Leu Asn Lys Tyr Lys Glu Glu His Arg Arg Ile LeuGln Glu Ser Ile 835 840 845 Asp Val Ala Pro Phe Thr Thr Lys Ile Lys GlyLeu Glu Gly Glu Arg 850 855 860 Glu Asn Tyr Ser Arg Val Ala Ser Ser SerSer Ser Pro Lys Ser His 865 870 875 880 Ile Ile Lys Gln Asp Met Asp ValGlu Arg Ser Val Ser Asp Leu Tyr 885 890 895 Lys Met Lys His Ser Val ProGln Ser Leu Pro Gln Ser Asn Tyr Phe 900 905 910 Thr Thr Leu Ser Asn SerVal Val Asn Glu Pro Pro Arg Ser Tyr Pro 915 920 925 Ser Lys Glu Val SerAsn Ile Tyr Gly Asp Lys Gln Ser Asn Ala Leu 930 935 940 Ala Ala Ala AlaAla Asn Pro Gln Thr Leu Thr Ser Phe Ile Thr Ser 945 950 955 960 Leu SerLys Pro Pro Pro Leu Ile Lys His Gln Pro Glu Ser Glu Gly 965 970 975 LeuVal Gly Lys Ile Pro Glu His Leu Pro His Gln Ile Ala Ser His 980 985 990Ser Val Thr Thr Phe Arg Asn Asp Cys Arg Ser Pro Thr His Leu Thr 995 10001005 Val Ser Ser Thr Asn Thr Leu Arg Ser Met Pro Ala Leu His Arg Ala1010 1015 1020 Pro Val Phe His Pro Pro Ile His His Ser Leu Glu Arg LysGlu Gly 1025 1030 1035 1040 Ser Tyr Ser Ser Leu Ser Pro Pro Thr Leu ThrPro Val Met Pro Val 1045 1050 1055 Asn Ala Gly Gly Lys Val Gln Glu SerGln Lys Pro Pro Thr Leu Ile 1060 1065 1070 Pro Glu Pro Lys Asp Ser GlnAla Asn Phe Lys Ser Ser Ser Glu Gln 1075 1080 1085 Ser Leu Thr Glu MetTrp Arg Pro Asn Asn Asn Leu Ser Lys Glu Lys 1090 1095 1100 Thr Glu TrpHis Val Glu Lys Ser Ser Gly Lys Leu Gln Ala Ala Met 1105 1110 1115 1120Ala Ser Val Ile Val Arg Pro Ser Ser Ser Thr Lys Thr Asp Ser Met 11251130 1135 Pro Ala Met Gln Leu Ala Ser Lys Asp Arg Val Ser Glu Arg SerSer 1140 1145 1150 Ala Gly Ala His Lys Thr Asp Cys Leu Lys Leu Ala GluAla Gly Glu 1155 1160 1165 Thr Gly Arg Ile Ile Leu Pro Asn Val Asn SerAsp Ser Val His Thr 1170 1175 1180 Lys Ser Glu Lys Asn Phe Gln Ala ValSer Gln Gly Ser Val Pro Ser 1185 1190 1195 1200 Ser Val Met Ser Ala ValAsn Thr Met Cys Asn Thr Lys Thr Asp Val 1205 1210 1215 Ile Thr Ser AlaAla Asp Thr Thr Ser Val Ser Ser Trp Gly Gly Ser 1220 1225 1230 Glu ValIle Ser Ser Leu Ser Asn Thr Ile Leu Ala Ser Thr Ser Ser 1235 1240 1245Glu Cys Val Ser Ser Lys Ser Val Ser Gln Pro Val Ala Gln Lys Gln 12501255 1260 Glu Cys Lys Val Ser Thr Thr Ala Pro Val Thr Leu Ala Ser SerLys 1265 1270 1275 1280 Thr Gly Ser Val Val Gln Pro Ser Ser Gly Phe SerGly Thr Thr Asp 1285 1290 1295 Phe Ile His Leu Lys Lys His Lys Ala AlaLeu Ala Ala Ala Gln Tyr 1300 1305 1310 Lys Ser Ser Asn Ala Ser Glu ThrGlu Pro Asn Ala Ile Lys Asn Gln 1315 1320 1325 Thr Leu Ser Ala Ser LeuPro Leu Asp Ser Thr Val Ile Cys Ser Thr 1330 1335 1340 Ile Asn Lys AlaAsn Ser Val Gly Asn Gly Gln Ala Ser Gln Thr Ser 1345 1350 1355 1360 GlnPro Asn Tyr His Thr Lys Leu Lys Lys Ala Trp Leu Thr Arg His 1365 13701375 Ser Glu Glu Asp Lys Asn Thr Asn Lys Met Glu Asn Ser Gly Asn Ser1380 1385 1390 Val Ser Glu Ile Ile Lys Pro Cys Ser Val Asn Leu Ile AlaSer Thr 1395 1400 1405 Ser Ser Asp Ile Gln Asn Ser Val Asp Ser Lys IleIle Val Asp Lys 1410 1415 1420 Tyr Val Lys Asp Asp Lys Val Asn Arg ArgLys Ala Lys Arg Thr Tyr 1425 1430 1435 1440 Glu Ser Gly Ser Glu Ser GlyAsp Ser Asp Glu Ser Glu Ser Lys Ser 1445 1450 1455 Glu Gln Arg Thr LysArg Gln Pro Lys Pro Thr Tyr Lys Lys Lys Gln 1460 1465 1470 Asn Asp LeuGln Lys Arg Lys Gly Glu Ile Glu Glu Asp Leu Lys Pro 1475 1480 1485 AsnGly Val Leu Ser Arg Ser Ala Lys Glu Arg Ser Lys Leu Lys Leu 1490 14951500 Gln Ser Asn Ser Asn Thr Gly Ile Pro Arg Ser Val Leu Lys Asp Trp1505 1510 1515 1520 Arg Lys Val Lys Lys Leu Lys Gln Thr Gly Glu Ser PheLeu Gln Asp 1525 1530 1535 Asp Ser Cys Cys Glu Ile Gly Pro Asn Leu GlnLys Cys Arg Glu Cys 1540 1545 1550 Arg Leu Ile Arg Ser Lys Lys Gly GluGlu Pro Ala His Ser Pro Val 1555 1560 1565 Phe Cys Arg Phe Tyr Tyr PheArg Arg Leu Ser Phe Ser Lys Asn Gly 1570 1575 1580 Val Val Arg Ile AspGly Phe Ser Ser Pro Asp Gln Tyr Asp Asp Glu 1585 1590 1595 1600 Ala MetSer Leu Trp Thr His Glu Asn Phe Glu Asp Asp Glu Leu Asp 1605 1610 1615Ile Glu Thr Ser Lys Tyr Ile Leu Asp Ile Ile Gly Asp Lys Phe Cys 16201625 1630 Gln Leu Val Thr Ser Glu Lys Thr Ala Leu Ser Trp Val Lys LysAsp 1635 1640 1645 Ala Lys Ile Ala Trp Lys Arg Ala Val Arg Gly Val ArgGlu Met Cys 1650 1655 1660 Asp Ala Cys Glu Ala Thr Leu Phe Asn Ile HisTrp Val Cys Gln Lys 1665 1670 1675 1680 Cys Gly Phe Val Val Cys Leu AspCys Tyr Lys Ala Lys Glu Arg Lys 1685 1690 1695 Ser Ser Arg Asp Lys GluLeu Tyr Ala Trp Met Lys Cys Val Lys Gly 1700 1705 1710 Gln Pro His AspHis Lys His Leu Met Pro Thr Gln Ile Ile Pro Gly 1715 1720 1725 Ser ValLeu Thr Asp Leu Leu Asp Ala Met His Thr Leu Arg Glu Lys 1730 1735 1740Tyr Gly Ile Lys Ser His Cys His Cys Thr Asn Lys Gln Asn Leu Gln 17451750 1755 1760 Val Gly Asn Phe Pro Thr Met Asn Gly Val Ser Gln Val LeuGln Asn 1765 1770 1775 Val Leu Asn His Ser Asn Lys Ile Ser Leu Cys MetPro Glu Ser Gln 1780 1785 1790 Gln Gln Asn Thr Pro Pro Lys Ser Glu LysAsn Gly Gly Ser Ser Pro 1795 1800 1805 Glu Ser Asp Val Gly Thr Asp AsnLys Leu Thr Pro Pro Glu Ser Gln 1810 1815 1820 Ser Pro Leu His Trp LeuAla Asp Leu Ala Glu Gln Lys Ala Arg Glu 1825 1830 1835 1840 Glu Lys LysGlu Asn Lys Glu Leu Thr Leu Glu Asn Gln Ile Lys Glu 1845 1850 1855 GluArg Glu Gln Asp Asn Ser Glu Ser Pro Asn Gly Arg Thr Ser Pro 1860 18651870 Leu Val Ser Gln Asn Asn Glu Gln Gly Ser Thr Leu Arg Asp Leu Leu1875 1880 1885 Thr Thr Thr Ala Gly Lys Leu Arg Val Gly Ser Thr Asp AlaGly Ile 1890 1895 1900 Ala Phe Ala Pro Val Tyr Ser Met Gly Ala Pro SerSer Lys Ser Gly 1905 1910 1915 1920 Arg Thr Met Pro Asn Ile Leu Asp AspIle Ile Ala Ser Val Val Glu 1925 1930 1935 Asn Lys Ile Pro Pro Ser LysThr Ser Lys Ile Asn Val Lys Pro Glu 1940 1945 1950 Leu Lys Glu Glu ProGlu Glu Ser Ile Ile Ser Ala Val Asp Glu Asn 1955 1960 1965 Asn Lys LeuTyr Ser Asp Ile Pro His Ser Trp Ile Cys Glu Lys His 1970 1975 1980 IleLeu Trp Leu Lys Asp Tyr Lys Asn Ser Ser Asn Trp Lys Leu Phe 1985 19901995 2000 Lys Glu Cys Trp Lys Gln Gly Gln Pro Ala Val Val Ser Gly ValHis 2005 2010 2015 Lys Lys Met Asn Ile Ser Leu Trp Lys Ala Glu Ser IleSer Leu Asp 2020 2025 2030 Phe Gly Asp His Gln Ala Asp Leu Leu Asn CysLys Asp Ser Ile Ile 2035 2040 2045 Ser Asn Ala Asn Val Lys Glu Phe TrpAsp Gly Phe Glu Glu Val Ser 2050 2055 2060 Lys Arg Gln Lys Asn Lys SerGly Glu Thr Val Val Leu Lys Leu Lys 2065 2070 2075 2080 Asp Trp Pro SerGly Glu Asp Phe Lys Thr Met Met Pro Ala Arg Tyr 2085 2090 2095 Glu AspLeu Leu Lys Ser Leu Pro Leu Pro Glu Tyr Cys Asn Pro Glu 2100 2105 2110Gly Lys Phe Asn Leu Ala Ser His Leu Pro Gly Phe Phe Val Arg Pro 21152120 2125 Asp Leu Gly Pro Arg Leu Cys Ser Ala Tyr Gly Val Val Ala AlaLys 2130 2135 2140 Asp His Asp Ile Gly Thr Thr Asn Leu His Ile Glu ValSer Asp Val 2145 2150 2155 2160 Val Asn Ile Leu Val Tyr Val Gly Ile AlaLys Gly Asn Gly Ile Leu 2165 2170 2175 Ser Lys Ala Gly Ile Leu Lys LysPhe Glu Glu Glu Asp Leu Asp Asp 2180 2185 2190 Ile Leu Arg Lys Arg LeuLys Asp Ser Ser Glu Ile Pro Gly Ala Leu 2195 2200 2205 Trp His Ile TyrAla Gly Lys Asp Val Asp Lys Ile Arg Glu Phe Leu 2210 2215 2220 Gln LysIle Ser Lys Glu Gln Gly Leu Glu Val Leu Pro Glu His Asp 2225 2230 22352240 Pro Ile Arg Asp Gln Ser Trp Tyr Val Asn Lys Lys Leu Arg Gln Arg2245 2250 2255 Leu Leu Glu Glu Tyr Gly Val Arg Thr Cys Thr Leu Ile GlnPhe Leu 2260 2265 2270 Gly Asp Ala Ile Val Leu Pro Ala Gly Ala Leu HisGln Val Gln Asn 2275 2280 2285 Phe His Ser Cys Ile Gln Val Thr Glu AspPhe Val Ser Pro Glu His 2290 2295 2300 Leu Val Glu Ser Phe His Leu ThrGln Glu Leu Arg Leu Leu Lys Glu 2305 2310 2315 2320 Glu Ile Asn Tyr AspAsp Lys Leu Gln Val Lys Asn Ile Leu Tyr His 2325 2330 2335 Ala Val LysGlu Met Val Arg Ala Leu Lys Ile His Glu Asp Glu Val 2340 2345 2350 GluAsp Met Glu Glu Asn 2355 16 2540 PRT Homo sapiens 16 Met Ala Val Glu ThrArg Ala Glu Leu Val Gly Lys Arg Phe Leu Cys 1 5 10 15 Val Ala Val GlyAsp Glu Ala Arg Ser Glu Arg Trp Glu Ser Gly Arg 20 25 30 Gly Trp Arg SerTrp Arg Ala Gly Val Ile Arg Ala Val Ser His Arg 35 40 45 Asp Ser Arg AsnPro Asp Leu Ala Val Tyr Val Glu Phe Asp Asp Leu 50 55 60 Glu Trp Asp LysArg Glu Trp Val Lys Val Tyr Glu Asp Phe Ser Thr 65 70 75 80 Phe Leu ValGlu Tyr His Leu Ile Trp Ala Lys Arg Asn Asp Pro Ser 85 90 95 Gln Thr GlnGly Ser Lys Ser Lys Gln Ile Gln Trp Pro Ala Leu Thr 100 105 110 Phe LysPro Leu Val Glu Arg Asn Ile Pro Ser Ser Val Thr Ala Val 115 120 125 GluPhe Leu Val Asp Lys Gln Leu Asp Phe Leu Thr Glu Asp Ser Ala 130 135 140Phe Gln Pro Tyr Gln Asp Asp Ile Asp Ser Leu Asn Pro Val Leu Arg 145 150155 160 Asp Asn Pro Gln Leu His Glu Glu Val Lys Val Trp Val Lys Glu Gln165 170 175 Lys Val Gln Glu Ile Phe Met Gln Gly Pro Tyr Ser Leu Asn GlyTyr 180 185 190 Arg Val Arg Val Tyr Arg Gln Asp Ser Ala Thr Gln Trp PheThr Gly 195 200 205 Ile Ile Thr His His Asp Leu Phe Thr Arg Thr Met IleVal Met Asn 210 215 220 Asp Gln Val Leu Glu Pro Gln Asn Val Asp Pro SerMet Val Gln Met 225 230 235 240 Thr Phe Leu Asp Asp Val Val His Ser LeuLeu Lys Gly Glu Asn Ile 245 250 255 Gly Ile Thr Ser Arg Arg Arg Ser ArgAla Asn Gln Asn Val Asn Ala 260 265 270 Val His Ser His Tyr Thr Arg AlaGln Ala Asn Ser Pro Arg Pro Ala 275 280 285 Met Asn Ser Gln Ala Ala ValPro Lys Gln Asn Thr His Gln Gln Gln 290 295 300 Gln Gln Arg Ser Ile ArgPro Asn Lys Arg Lys Gly Ser Asp Ser Ser 305 310 315 320 Ile Pro Asp GluGlu Lys Met Lys Glu Glu Lys Tyr Asp Tyr Ile Ser 325 330 335 Arg Gly GluAsn Pro Lys Gly Lys Asn Lys His Leu Met Asn Lys Arg 340 345 350 Arg LysPro Glu Glu Asp Glu Lys Lys Leu Asn Met Lys Arg Leu Arg 355 360 365 ThrAsp Asn Val Ser Asp Phe Ser Glu Ser Ser Asp Ser Glu Asn Ser 370 375 380Asn Lys Arg Ile Ile Asp Asn Ser Ser Glu Gln Lys Pro Glu Asn Glu 385 390395 400 Leu Lys Asn Lys Asn Thr Ser Lys Ile Asn Gly Glu Glu Gly Lys Pro405 410 415 His Asn Asn Glu Lys Ala Gly Glu Glu Thr Leu Lys Asn Ser GlnPro 420 425 430 Pro Trp Asp Gln Ile Gln Glu Asp Lys Lys His Glu Glu AlaGlu Lys 435 440 445 Arg Lys Ser Val Asp Thr Gln Leu Gln Glu Asp Met IleIle His Ser 450 455 460 Ser Glu Gln Ser Thr Val Ser Asp His Asn Ser AsnAsp Leu Leu Pro 465 470 475 480 Gln Glu Cys Asn Met Asp Lys Thr His ThrMet Glu Leu Leu Pro Lys 485 490 495 Glu Lys Phe Val Ser Arg Pro Pro ThrPro Lys Cys Val Ile Asp Ile 500 505 510 Thr Asn Asp Thr Asn Leu Glu LysVal Ala Gln Glu Asn Ser Ser Thr 515 520 525 Phe Gly Leu Gln Thr Leu GlnLys Met Asp Pro Asn Val Ser Asp Ser 530 535 540 Lys His Ser Ile Ala AsnAla Lys Phe Leu Glu Thr Ala Lys Lys Asp 545 550 555 560 Ser Asp Gln SerTrp Val Ser Asp Val Val Lys Val Asp Leu Thr Gln 565 570 575 Ser Ser ValThr Asn Ala Ser Ser Gly Asn Asp His Leu Asn Met Glu 580 585 590 Lys GluLys Tyr Val Ser Tyr Ile Ser Pro Leu Ser Ala Val Ser Val 595 600 605 MetGlu Asp Lys Leu His Lys Arg Ser Pro Pro Pro Glu Thr Ile Lys 610 615 620Ser Lys Leu Asn Thr Ser Val Asp Thr His Lys Ile Lys Ser Ser Pro 625 630635 640 Ser Pro Glu Val Val Lys Pro Lys Ile Thr His Ser Pro Asp Ser Val645 650 655 Lys Ser Lys Ala Thr Tyr Val Asn Ser Gln Ala Thr Gly Glu ArgArg 660 665 670 Leu Ala Asn Lys Ile Glu His Glu Leu Ser Arg Cys Ser PheHis Pro 675 680 685 Ile Pro Thr Arg Ser Ser Thr Leu Glu Thr Thr Lys SerPro Leu Ile 690 695 700 Ile Asp Lys Asn Glu His Phe Thr Val Tyr Arg AspPro Ala Leu Ile 705 710 715 720 Gly Ser Glu Thr Gly Ala Asn His Ile SerPro Phe Leu Ser Gln His 725 730 735 Pro Phe Pro Leu His Ser Ser Ser HisArg Thr Cys Leu Asn Pro Gly 740 745 750 Thr His His Pro Ala Leu Thr ProAla Pro His Leu Leu Ala Gly Ser 755 760 765 Ser Ser Gln Thr Pro Leu ProThr Ile Asn Thr His Pro Leu Thr Ser 770 775 780 Gly Pro His His Ala ValHis His Pro His Leu Leu Pro Thr Val Leu 785 790 795 800 Pro Gly Val ProThr Ala Ser Leu Leu Gly Gly His Pro Arg Leu Glu 805 810 815 Ser Ala HisAla Ser Ser Leu Ser His Leu Ala Leu Ala His Gln Gln 820 825 830 Gln GlnGln Leu Leu Gln His Gln Ser Pro His Leu Leu Gly Gln Ala 835 840 845 HisPro Ser Ala Ser Tyr Asn Gln Leu Gly Leu Tyr Pro Ile Ile Trp 850 855 860Gln Tyr Pro Asn Gly Thr His Ala Tyr Ser Gly Leu Gly Leu Pro Ser 865 870875 880 Ser Lys Trp Val His Pro Glu Asn Ala Val Asn Ala Glu Ala Ser Leu885 890 895 Arg Arg Asn Ser Pro Ser Pro Trp Leu His Gln Pro Thr Pro ValThr 900 905 910 Ser Ala Asp Gly Ile Gly Leu Leu Ser His Ile Pro Val ArgPro Ser 915 920 925 Ser Ala Glu Pro His Arg Pro Leu Lys Ile Thr Ala HisSer Ser Pro 930 935 940 Pro Leu Thr Lys Thr Leu Val Asp His His Lys GluGlu Leu Glu Arg 945 950 955 960 Lys Ala Phe Met Glu Pro Leu Arg Ser ValAla Ser Thr Ser Ala Lys 965 970 975 Asn Asp Leu Asp Leu Asn Arg Ser GlnThr Gly Lys Asp Cys His Leu 980 985 990 His Arg His Phe Val Asp Pro ValLeu Asn Gln Leu Gln Arg Pro Pro 995 1000 1005 Gln Glu Thr Gly Glu ArgLeu Asn Lys Tyr Lys Glu Glu His Arg Arg 1010 1015 1020 Ile Leu Gln GluSer Ile Asp Val Ala Pro Phe Thr Thr Lys Ile Lys 1025 1030 1035 1040 GlyLeu Glu Gly Glu Arg Glu Asn Tyr Ser Arg Val Ala Ser Ser Ser 1045 10501055 Ser Ser Pro Lys Ser His Ile Ile Lys Gln Asp Met Asp Val Glu Arg1060 1065 1070 Ser Val Ser Asp Leu Tyr Lys Met Lys His Ser Val Pro GlnSer Leu 1075 1080 1085 Pro Gln Ser Asn Tyr Phe Thr Thr Leu Ser Asn SerVal Val Asn Glu 1090 1095 1100 Pro Pro Arg Ser Tyr Pro Ser Lys Glu ValSer Asn Ile Tyr Gly Asp 1105 1110 1115 1120 Lys Gln Ser Asn Ala Leu AlaAla Ala Ala Ala Asn Pro Gln Thr Leu 1125 1130 1135 Thr Ser Phe Ile ThrSer Leu Ser Lys Pro Pro Pro Leu Ile Lys His 1140 1145 1150 Gln Pro GluSer Glu Gly Leu Val Gly Lys Ile Pro Glu His Leu Pro 1155 1160 1165 HisGln Ile Ala Ser His Ser Val Thr Thr Phe Arg Asn Asp Cys Arg 1170 11751180 Ser Pro Thr His Leu Thr Val Ser Ser Thr Asn Thr Leu Arg Ser Met1185 1190 1195 1200 Pro Ala Leu His Arg Ala Pro Val Phe His Pro Pro IleHis His Ser 1205 1210 1215 Leu Glu Arg Lys Glu Gly Ser Tyr Ser Ser LeuSer Pro Pro Thr Leu 1220 1225 1230 Thr Pro Val Met Pro Val Asn Ala GlyGly Lys Val Gln Glu Ser Gln 1235 1240 1245 Lys Pro Pro Thr Leu Ile ProGlu Pro Lys Asp Ser Gln Ala Asn Phe 1250 1255 1260 Lys Ser Ser Ser GluGln Ser Leu Thr Glu Met Trp Arg Pro Asn Asn 1265 1270 1275 1280 Asn LeuSer Lys Glu Lys Thr Glu Trp His Val Glu Lys Ser Ser Gly 1285 1290 1295Lys Leu Gln Ala Ala Met Ala Ser Val Ile Val Arg Pro Ser Ser Ser 13001305 1310 Thr Lys Thr Asp Ser Met Pro Ala Met Gln Leu Ala Ser Lys AspArg 1315 1320 1325 Val Ser Glu Arg Ser Ser Ala Gly Ala His Lys Thr AspCys Leu Lys 1330 1335 1340 Leu Ala Glu Ala Gly Glu Thr Gly Arg Ile IleLeu Pro Asn Val Asn 1345 1350 1355 1360 Ser Asp Ser Val His Thr Lys SerGlu Lys Asn Phe Gln Ala Val Ser 1365 1370 1375 Gln Gly Ser Val Pro SerSer Val Met Ser Ala Val Asn Thr Met Cys 1380 1385 1390 Asn Thr Lys ThrAsp Val Ile Thr Ser Ala Ala Asp Thr Thr Ser Val 1395 1400 1405 Ser SerTrp Gly Gly Ser Glu Val Ile Ser Ser Leu Ser Asn Thr Ile 1410 1415 1420Leu Ala Ser Thr Ser Ser Glu Cys Val Ser Ser Lys Ser Val Ser Gln 14251430 1435 1440 Pro Val Ala Gln Lys Gln Glu Cys Lys Val Ser Thr Thr AlaPro Val 1445 1450 1455 Thr Leu Ala Ser Ser Lys Thr Gly Ser Val Val GlnPro Ser Ser Gly 1460 1465 1470 Phe Ser Gly Thr Thr Asp Phe Ile His LeuLys Lys His Lys Ala Ala 1475 1480 1485 Leu Ala Ala Ala Gln Tyr Lys SerSer Asn Ala Ser Glu Thr Glu Pro 1490 1495 1500 Asn Ala Ile Lys Asn GlnThr Leu Ser Ala Ser Leu Pro Leu Asp Ser 1505 1510 1515 1520 Thr Val IleCys Ser Thr Ile Asn Lys Ala Asn Ser Val Gly Asn Gly 1525 1530 1535 GlnAla Ser Gln Thr Ser Gln Pro Asn Tyr His Thr Lys Leu Lys Lys 1540 15451550 Ala Trp Leu Thr Arg His Ser Glu Glu Asp Lys Asn Thr Asn Lys Met1555 1560 1565 Glu Asn Ser Gly Asn Ser Val Ser Glu Ile Ile Lys Pro CysSer Val 1570 1575 1580 Asn Leu Ile Ala Ser Thr Ser Ser Asp Ile Gln AsnSer Val Asp Ser 1585 1590 1595 1600 Lys Ile Ile Val Asp Lys Tyr Val LysAsp Asp Lys Val Asn Arg Arg 1605 1610 1615 Lys Ala Lys Arg Thr Tyr GluSer Gly Ser Glu Ser Gly Asp Ser Asp 1620 1625 1630 Glu Ser Glu Ser LysSer Glu Gln Arg Thr Lys Arg Gln Pro Lys Pro 1635 1640 1645 Thr Tyr LysLys Lys Gln Asn Asp Leu Gln Lys Arg Lys Gly Glu Ile 1650 1655 1660 GluGlu Asp Leu Lys Pro Asn Gly Val Leu Ser Arg Ser Ala Lys Glu 1665 16701675 1680 Arg Ser Lys Leu Lys Leu Gln Ser Asn Ser Asn Thr Gly Ile ProArg 1685 1690 1695 Ser Val Leu Lys Asp Trp Arg Lys Val Lys Lys Leu LysGln Thr Gly 1700 1705 1710 Glu Ser Phe Leu Gln Asp Asp Ser Cys Cys GluIle Gly Pro Asn Leu 1715 1720 1725 Gln Lys Cys Arg Glu Cys Arg Leu IleArg Ser Lys Lys Gly Glu Glu 1730 1735 1740 Pro Ala His Ser Pro Val PheCys Arg Phe Tyr Tyr Phe Arg Arg Leu 1745 1750 1755 1760 Ser Phe Ser LysAsn Gly Val Val Arg Ile Asp Gly Phe Ser Ser Pro 1765 1770 1775 Asp GlnTyr Asp Asp Glu Ala Met Ser Leu Trp Thr His Glu Asn Phe 1780 1785 1790Glu Asp Asp Glu Leu Asp Ile Glu Thr Ser Lys Tyr Ile Leu Asp Ile 17951800 1805 Ile Gly Asp Lys Phe Cys Gln Leu Val Thr Ser Glu Lys Thr AlaLeu 1810 1815 1820 Ser Trp Val Lys Lys Asp Ala Lys Ile Ala Trp Lys ArgAla Val Arg 1825 1830 1835 1840 Gly Val Arg Glu Met Cys Asp Ala Cys GluAla Thr Leu Phe Asn Ile 1845 1850 1855 His Trp Val Cys Gln Lys Cys GlyPhe Val Val Cys Leu Asp Cys Tyr 1860 1865 1870 Lys Ala Lys Glu Arg LysSer Ser Arg Asp Lys Glu Leu Tyr Ala Trp 1875 1880 1885 Met Lys Cys ValLys Gly Gln Pro His Asp His Lys His Leu Met Pro 1890 1895 1900 Thr GlnIle Ile Pro Gly Ser Val Leu Thr Asp Leu Leu Asp Ala Met 1905 1910 19151920 His Thr Leu Arg Glu Lys Tyr Gly Ile Lys Ser His Cys His Cys Thr1925 1930 1935 Asn Lys Gln Asn Leu Gln Val Gly Asn Phe Pro Thr Met AsnGly Val 1940 1945 1950 Ser Gln Val Leu Gln Asn Val Leu Asn His Ser AsnLys Ile Ser Leu 1955 1960 1965 Cys Met Pro Glu Ser Gln Gln Gln Asn ThrPro Pro Lys Ser Glu Lys 1970 1975 1980 Asn Gly Gly Ser Ser Pro Glu SerAsp Val Gly Thr Asp Asn Lys Leu 1985 1990 1995 2000 Thr Pro Pro Glu SerGln Ser Pro Leu His Trp Leu Ala Asp Leu Ala 2005 2010 2015 Glu Gln LysAla Arg Glu Glu Lys Lys Glu Asn Lys Glu Leu Thr Leu 2020 2025 2030 GluAsn Gln Ile Lys Glu Glu Arg Glu Gln Asp Asn Ser Glu Ser Pro 2035 20402045 Asn Gly Arg Thr Ser Pro Leu Val Ser Gln Asn Asn Glu Gln Gly Ser2050 2055 2060 Thr Leu Arg Asp Leu Leu Thr Thr Thr Ala Gly Lys Leu ArgVal Gly 2065 2070 2075 2080 Ser Thr Asp Ala Gly Ile Ala Phe Ala Pro ValTyr Ser Met Gly Ala 2085 2090 2095 Pro Ser Ser Lys Ser Gly Arg Thr MetPro Asn Ile Leu Asp Asp Ile 2100 2105 2110 Ile Ala Ser Val Val Glu AsnLys Ile Pro Pro Ser Lys Thr Ser Lys 2115 2120 2125 Ile Asn Val Lys ProGlu Leu Lys Glu Glu Pro Glu Glu Ser Ile Ile 2130 2135 2140 Ser Ala ValAsp Glu Asn Asn Lys Leu Tyr Ser Asp Ile Pro His Ser 2145 2150 2155 2160Trp Ile Cys Glu Lys His Ile Leu Trp Leu Lys Asp Tyr Lys Asn Ser 21652170 2175 Ser Asn Trp Lys Leu Phe Lys Glu Cys Trp Lys Gln Gly Gln ProAla 2180 2185 2190 Val Val Ser Gly Val His Lys Lys Met Asn Ile Ser LeuTrp Lys Ala 2195 2200 2205 Glu Ser Ile Ser Leu Asp Phe Gly Asp His GlnAla Asp Leu Leu Asn 2210 2215 2220 Cys Lys Asp Ser Ile Ile Ser Asn AlaAsn Val Lys Glu Phe Trp Asp 2225 2230 2235 2240 Gly Phe Glu Glu Val SerLys Arg Gln Lys Asn Lys Ser Gly Glu Thr 2245 2250 2255 Val Val Leu LysLeu Lys Asp Trp Pro Ser Gly Glu Asp Phe Lys Thr 2260 2265 2270 Met MetPro Ala Arg Tyr Glu Asp Leu Leu Lys Ser Leu Pro Leu Pro 2275 2280 2285Glu Tyr Cys Asn Pro Glu Gly Lys Phe Asn Leu Ala Ser His Leu Pro 22902295 2300 Gly Phe Phe Val Arg Pro Asp Leu Gly Pro Arg Leu Cys Ser AlaTyr 2305 2310 2315 2320 Gly Val Val Ala Ala Lys Asp His Asp Ile Gly ThrThr Asn Leu His 2325 2330 2335 Ile Glu Val Ser Asp Val Val Asn Ile LeuVal Tyr Val Gly Ile Ala 2340 2345 2350 Lys Gly Asn Gly Ile Leu Ser LysAla Gly Ile Leu Lys Lys Phe Glu 2355 2360 2365 Glu Glu Asp Leu Asp AspIle Leu Arg Lys Arg Leu Lys Asp Ser Ser 2370 2375 2380 Glu Ile Pro GlyAla Leu Trp His Ile Tyr Ala Gly Lys Asp Val Asp 2385 2390 2395 2400 LysIle Arg Glu Phe Leu Gln Lys Ile Ser Lys Glu Gln Gly Leu Glu 2405 24102415 Val Leu Pro Glu His Asp Pro Ile Arg Asp Gln Ser Trp Tyr Val Asn2420 2425 2430 Lys Lys Leu Arg Gln Arg Leu Leu Glu Glu Tyr Gly Val ArgThr Cys 2435 2440 2445 Thr Leu Ile Gln Phe Leu Gly Asp Ala Ile Val LeuPro Ala Gly Ala 2450 2455 2460 Leu His Gln Val Gln Asn Phe His Ser CysIle Gln Val Thr Glu Asp 2465 2470 2475 2480 Phe Val Ser Pro Glu His LeuVal Glu Ser Phe His Leu Thr Gln Glu 2485 2490 2495 Leu Arg Leu Leu LysGlu Glu Ile Asn Tyr Asp Asp Lys Leu Gln Val 2500 2505 2510 Lys Asn IleLeu Tyr His Ala Val Lys Glu Met Val Arg Ala Leu Lys 2515 2520 2525 IleHis Glu Asp Glu Val Glu Asp Met Glu Glu Asn 2530 2535 2540 17 2321 PRTHomo sapiens 17 Met Ile Val Met Asn Asp Gln Val Leu Glu Pro Gln Asn ValAsp Pro 1 5 10 15 Ser Met Val Gln Met Thr Phe Leu Asp Asp Val Val HisSer Leu Leu 20 25 30 Lys Gly Glu Asn Ile Gly Ile Thr Ser Arg Arg Arg SerArg Ala Asn 35 40 45 Gln Asn Val Asn Ala Val His Ser His Tyr Thr Arg AlaGln Ala Asn 50 55 60 Ser Pro Arg Pro Ala Met Asn Ser Gln Ala Ala Val ProLys Gln Asn 65 70 75 80 Thr His Gln Gln Gln Gln Gln Arg Ser Ile Arg ProAsn Lys Arg Lys 85 90 95 Gly Ser Asp Ser Ser Ile Pro Asp Glu Glu Lys MetLys Glu Glu Lys 100 105 110 Tyr Asp Tyr Ile Ser Arg Gly Glu Asn Pro LysGly Lys Asn Lys His 115 120 125 Leu Met Asn Lys Arg Arg Lys Pro Glu GluAsp Glu Lys Lys Leu Asn 130 135 140 Met Lys Arg Leu Arg Thr Asp Asn ValSer Asp Phe Ser Glu Ser Ser 145 150 155 160 Asp Ser Glu Asn Ser Asn LysArg Ile Ile Asp Asn Ser Ser Glu Gln 165 170 175 Lys Pro Glu Asn Glu LeuLys Asn Lys Asn Thr Ser Lys Ile Asn Gly 180 185 190 Glu Glu Gly Lys ProHis Asn Asn Glu Lys Ala Gly Glu Glu Thr Leu 195 200 205 Lys Asn Ser GlnPro Pro Trp Asp Gln Ile Gln Glu Asp Lys Lys His 210 215 220 Glu Glu AlaGlu Lys Arg Lys Ser Val Asp Thr Gln Leu Gln Glu Asp 225 230 235 240 MetIle Ile His Ser Ser Glu Gln Ser Thr Val Ser Asp His Asn Ser 245 250 255Asn Asp Leu Leu Pro Gln Glu Cys Asn Met Asp Lys Thr His Thr Met 260 265270 Glu Leu Leu Pro Lys Glu Lys Phe Val Ser Arg Pro Pro Thr Pro Lys 275280 285 Cys Val Ile Asp Ile Thr Asn Asp Thr Asn Leu Glu Lys Val Ala Gln290 295 300 Glu Asn Ser Ser Thr Phe Gly Leu Gln Thr Leu Gln Lys Met AspPro 305 310 315 320 Asn Val Ser Asp Ser Lys His Ser Ile Ala Asn Ala LysPhe Leu Glu 325 330 335 Thr Ala Lys Lys Asp Ser Asp Gln Ser Trp Val SerAsp Val Val Lys 340 345 350 Val Asp Leu Thr Gln Ser Ser Val Thr Asn AlaSer Ser Gly Asn Asp 355 360 365 His Leu Asn Met Glu Lys Glu Lys Tyr ValSer Tyr Ile Ser Pro Leu 370 375 380 Ser Ala Val Ser Val Met Glu Asp LysLeu His Lys Arg Ser Pro Pro 385 390 395 400 Pro Glu Thr Ile Lys Ser LysLeu Asn Thr Ser Val Asp Thr His Lys 405 410 415 Ile Lys Ser Ser Pro SerPro Glu Val Val Lys Pro Lys Ile Thr His 420 425 430 Ser Pro Asp Ser ValLys Ser Lys Ala Thr Tyr Val Asn Ser Gln Ala 435 440 445 Thr Gly Glu ArgArg Leu Ala Asn Lys Ile Glu His Glu Leu Ser Arg 450 455 460 Cys Ser PheHis Pro Ile Pro Thr Arg Ser Ser Thr Leu Glu Thr Thr 465 470 475 480 LysSer Pro Leu Ile Ile Asp Lys Asn Glu His Phe Thr Val Tyr Arg 485 490 495Asp Pro Ala Leu Ile Gly Ser Glu Thr Gly Ala Asn His Ile Ser Pro 500 505510 Phe Leu Ser Gln His Pro Phe Pro Leu His Ser Ser Ser His Arg Thr 515520 525 Cys Leu Asn Pro Gly Thr His His Pro Ala Leu Thr Pro Ala Pro His530 535 540 Leu Leu Ala Gly Ser Ser Ser Gln Thr Pro Leu Pro Thr Ile AsnThr 545 550 555 560 His Pro Leu Thr Ser Gly Pro His His Ala Val His HisPro His Leu 565 570 575 Leu Pro Thr Val Leu Pro Gly Val Pro Thr Ala SerLeu Leu Gly Gly 580 585 590 His Pro Arg Leu Glu Ser Ala His Ala Ser SerLeu Ser His Leu Ala 595 600 605 Leu Ala His Gln Gln Gln Gln Gln Leu LeuGln His Gln Ser Pro His 610 615 620 Leu Leu Gly Gln Ala His Pro Ser AlaSer Tyr Asn Gln Leu Gly Leu 625 630 635 640 Tyr Pro Ile Ile Trp Gln TyrPro Asn Gly Thr His Ala Tyr Ser Gly 645 650 655 Leu Gly Leu Pro Ser SerLys Trp Val His Pro Glu Asn Ala Val Asn 660 665 670 Ala Glu Ala Ser LeuArg Arg Asn Ser Pro Ser Pro Trp Leu His Gln 675 680 685 Pro Thr Pro ValThr Ser Ala Asp Gly Ile Gly Leu Leu Ser His Ile 690 695 700 Pro Val ArgPro Ser Ser Ala Glu Pro His Arg Pro Leu Lys Ile Thr 705 710 715 720 AlaHis Ser Ser Pro Pro Leu Thr Lys Thr Leu Val Asp His His Lys 725 730 735Glu Glu Leu Glu Arg Lys Ala Phe Met Glu Pro Leu Arg Ser Val Ala 740 745750 Ser Thr Ser Ala Lys Asn Asp Leu Asp Leu Asn Arg Ser Gln Thr Gly 755760 765 Lys Asp Cys His Leu His Arg His Phe Val Asp Pro Val Leu Asn Gln770 775 780 Leu Gln Arg Pro Pro Gln Glu Thr Gly Glu Arg Leu Asn Lys TyrLys 785 790 795 800 Glu Glu His Arg Arg Ile Leu Gln Glu Ser Ile Asp ValAla Pro Phe 805 810 815 Thr Thr Lys Ile Lys Gly Leu Glu Gly Glu Arg GluAsn Tyr Ser Arg 820 825 830 Val Ala Ser Ser Ser Ser Ser Pro Lys Ser HisIle Ile Lys Gln Asp 835 840 845 Met Asp Val Glu Arg Ser Val Ser Asp LeuTyr Lys Met Lys His Ser 850 855 860 Val Pro Gln Ser Leu Pro Gln Ser AsnTyr Phe Thr Thr Leu Ser Asn 865 870 875 880 Ser Val Val Asn Glu Pro ProArg Ser Tyr Pro Ser Lys Glu Val Ser 885 890 895 Asn Ile Tyr Gly Asp LysGln Ser Asn Ala Leu Ala Ala Ala Ala Ala 900 905 910 Asn Pro Gln Thr LeuThr Ser Phe Ile Thr Ser Leu Ser Lys Pro Pro 915 920 925 Pro Leu Ile LysHis Gln Pro Glu Ser Glu Gly Leu Val Gly Lys Ile 930 935 940 Pro Glu HisLeu Pro His Gln Ile Ala Ser His Ser Val Thr Thr Phe 945 950 955 960 ArgAsn Asp Cys Arg Ser Pro Thr His Leu Thr Val Ser Ser Thr Asn 965 970 975Thr Leu Arg Ser Met Pro Ala Leu His Arg Ala Pro Val Phe His Pro 980 985990 Pro Ile His His Ser Leu Glu Arg Lys Glu Gly Ser Tyr Ser Ser Leu 9951000 1005 Ser Pro Pro Thr Leu Thr Pro Val Met Pro Val Asn Ala Gly GlyLys 1010 1015 1020 Val Gln Glu Ser Gln Lys Pro Pro Thr Leu Ile Pro GluPro Lys Asp 1025 1030 1035 1040 Ser Gln Ala Asn Phe Lys Ser Ser Ser GluGln Ser Leu Thr Glu Met 1045 1050 1055 Trp Arg Pro Asn Asn Asn Leu SerLys Glu Lys Thr Glu Trp His Val 1060 1065 1070 Glu Lys Ser Ser Gly LysLeu Gln Ala Ala Met Ala Ser Val Ile Val 1075 1080 1085 Arg Pro Ser SerSer Thr Lys Thr Asp Ser Met Pro Ala Met Gln Leu 1090 1095 1100 Ala SerLys Asp Arg Val Ser Glu Arg Ser Ser Ala Gly Ala His Lys 1105 1110 11151120 Thr Asp Cys Leu Lys Leu Ala Glu Ala Gly Glu Thr Gly Arg Ile Ile1125 1130 1135 Leu Pro Asn Val Asn Ser Asp Ser Val His Thr Lys Ser GluLys Asn 1140 1145 1150 Phe Gln Ala Val Ser Gln Gly Ser Val Pro Ser SerVal Met Ser Ala 1155 1160 1165 Val Asn Thr Met Cys Asn Thr Lys Thr AspVal Ile Thr Ser Ala Ala 1170 1175 1180 Asp Thr Thr Ser Val Ser Ser TrpGly Gly Ser Glu Val Ile Ser Ser 1185 1190 1195 1200 Leu Ser Asn Thr IleLeu Ala Ser Thr Ser Ser Glu Cys Val Ser Ser 1205 1210 1215 Lys Ser ValSer Gln Pro Val Ala Gln Lys Gln Glu Cys Lys Val Ser 1220 1225 1230 ThrThr Ala Pro Val Thr Leu Ala Ser Ser Lys Thr Gly Ser Val Val 1235 12401245 Gln Pro Ser Ser Gly Phe Ser Gly Thr Thr Asp Phe Ile His Leu Lys1250 1255 1260 Lys His Lys Ala Ala Leu Ala Ala Ala Gln Tyr Lys Ser SerAsn Ala 1265 1270 1275 1280 Ser Glu Thr Glu Pro Asn Ala Ile Lys Asn GlnThr Leu Ser Ala Ser 1285 1290 1295 Leu Pro Leu Asp Ser Thr Val Ile CysSer Thr Ile Asn Lys Ala Asn 1300 1305 1310 Ser Val Gly Asn Gly Gln AlaSer Gln Thr Ser Gln Pro Asn Tyr His 1315 1320 1325 Thr Lys Leu Lys LysAla Trp Leu Thr Arg His Ser Glu Glu Asp Lys 1330 1335 1340 Asn Thr AsnLys Met Glu Asn Ser Gly Asn Ser Val Ser Glu Ile Ile 1345 1350 1355 1360Lys Pro Cys Ser Val Asn Leu Ile Ala Ser Thr Ser Ser Asp Ile Gln 13651370 1375 Asn Ser Val Asp Ser Lys Ile Ile Val Asp Lys Tyr Val Lys AspAsp 1380 1385 1390 Lys Val Asn Arg Arg Lys Ala Lys Arg Thr Tyr Glu SerGly Ser Glu 1395 1400 1405 Ser Gly Asp Ser Asp Glu Ser Glu Ser Lys SerGlu Gln Arg Thr Lys 1410 1415 1420 Arg Gln Pro Lys Pro Thr Tyr Lys LysLys Gln Asn Asp Leu Gln Lys 1425 1430 1435 1440 Arg Lys Gly Glu Ile GluGlu Asp Leu Lys Pro Asn Gly Val Leu Ser 1445 1450 1455 Arg Ser Ala LysGlu Arg Ser Lys Leu Lys Leu Gln Ser Asn Ser Asn 1460 1465 1470 Thr GlyIle Pro Arg Ser Val Leu Lys Asp Trp Arg Lys Val Lys Lys 1475 1480 1485Leu Lys Gln Thr Gly Glu Ser Phe Leu Gln Asp Asp Ser Cys Cys Glu 14901495 1500 Ile Gly Pro Asn Leu Gln Lys Cys Arg Glu Cys Arg Leu Ile ArgSer 1505 1510 1515 1520 Lys Lys Gly Glu Glu Pro Ala His Ser Pro Val PheCys Arg Phe Tyr 1525 1530 1535 Tyr Phe Arg Arg Leu Ser Phe Ser Lys AsnGly Val Val Arg Ile Asp 1540 1545 1550 Gly Phe Ser Ser Pro Asp Gln TyrAsp Asp Glu Ala Met Ser Leu Trp 1555 1560 1565 Thr His Glu Asn Phe GluAsp Asp Glu Leu Asp Ile Glu Thr Ser Lys 1570 1575 1580 Tyr Ile Leu AspIle Ile Gly Asp Lys Phe Cys Gln Leu Val Thr Ser 1585 1590 1595 1600 GluLys Thr Ala Leu Ser Trp Val Lys Lys Asp Ala Lys Ile Ala Trp 1605 16101615 Lys Arg Ala Val Arg Gly Val Arg Glu Met Cys Asp Ala Cys Glu Ala1620 1625 1630 Thr Leu Phe Asn Ile His Trp Val Cys Gln Lys Cys Gly PheVal Val 1635 1640 1645 Cys Leu Asp Cys Tyr Lys Ala Lys Glu Arg Lys SerSer Arg Asp Lys 1650 1655 1660 Glu Leu Tyr Ala Trp Met Lys Cys Val LysGly Gln Pro His Asp His 1665 1670 1675 1680 Lys His Leu Met Pro Thr GlnIle Ile Pro Gly Ser Val Leu Thr Asp 1685 1690 1695 Leu Leu Asp Ala MetHis Thr Leu Arg Glu Lys Tyr Gly Ile Lys Ser 1700 1705 1710 His Cys HisCys Thr Asn Lys Gln Asn Leu Gln Val Gly Asn Phe Pro 1715 1720 1725 ThrMet Asn Gly Val Ser Gln Val Leu Gln Asn Val Leu Asn His Ser 1730 17351740 Asn Lys Ile Ser Leu Cys Met Pro Glu Ser Gln Gln Gln Asn Thr Pro1745 1750 1755 1760 Pro Lys Ser Glu Lys Asn Gly Gly Ser Ser Pro Glu SerAsp Val Gly 1765 1770 1775 Thr Asp Asn Lys Leu Thr Pro Pro Glu Ser GlnSer Pro Leu His Trp 1780 1785 1790 Leu Ala Asp Leu Ala Glu Gln Lys AlaArg Glu Glu Lys Lys Glu Asn 1795 1800 1805 Lys Glu Leu Thr Leu Glu AsnGln Ile Lys Glu Glu Arg Glu Gln Asp 1810 1815 1820 Asn Ser Glu Ser ProAsn Gly Arg Thr Ser Pro Leu Val Ser Gln Asn 1825 1830 1835 1840 Asn GluGln Gly Ser Thr Leu Arg Asp Leu Leu Thr Thr Thr Ala Gly 1845 1850 1855Lys Leu Arg Val Gly Ser Thr Asp Ala Gly Ile Ala Phe Ala Pro Val 18601865 1870 Tyr Ser Met Gly Ala Pro Ser Ser Lys Ser Gly Arg Thr Met ProAsn 1875 1880 1885 Ile Leu Asp Asp Ile Ile Ala Ser Val Val Glu Asn LysIle Pro Pro 1890 1895 1900 Ser Lys Thr Ser Lys Ile Asn Val Lys Pro GluLeu Lys Glu Glu Pro 1905 1910 1915 1920 Glu Glu Ser Ile Ile Ser Ala ValAsp Glu Asn Asn Lys Leu Tyr Ser 1925 1930 1935 Asp Ile Pro His Ser TrpIle Cys Glu Lys His Ile Leu Trp Leu Lys 1940 1945 1950 Asp Tyr Lys AsnSer Ser Asn Trp Lys Leu Phe Lys Glu Cys Trp Lys 1955 1960 1965 Gln GlyGln Pro Ala Val Val Ser Gly Val His Lys Lys Met Asn Ile 1970 1975 1980Ser Leu Trp Lys Ala Glu Ser Ile Ser Leu Asp Phe Gly Asp His Gln 19851990 1995 2000 Ala Asp Leu Leu Asn Cys Lys Asp Ser Ile Ile Ser Asn AlaAsn Val 2005 2010 2015 Lys Glu Phe Trp Asp Gly Phe Glu Glu Val Ser LysArg Gln Lys Asn 2020 2025 2030 Lys Ser Gly Glu Thr Val Val Leu Lys LeuLys Asp Trp Pro Ser Gly 2035 2040 2045 Glu Asp Phe Lys Thr Met Met ProAla Arg Tyr Glu Asp Leu Leu Lys 2050 2055 2060 Ser Leu Pro Leu Pro GluTyr Cys Asn Pro Glu Gly Lys Phe Asn Leu 2065 2070 2075 2080 Ala Ser HisLeu Pro Gly Phe Phe Val Arg Pro Asp Leu Gly Pro Arg 2085 2090 2095 LeuCys Ser Ala Tyr Gly Val Val Ala Ala Lys Asp His Asp Ile Gly 2100 21052110 Thr Thr Asn Leu His Ile Glu Val Ser Asp Val Val Asn Ile Leu Val2115 2120 2125 Tyr Val Gly Ile Ala Lys Gly Asn Gly Ile Leu Ser Lys AlaGly Ile 2130 2135 2140 Leu Lys Lys Phe Glu Glu Glu Asp Leu Asp Asp IleLeu Arg Lys Arg 2145 2150 2155 2160 Leu Lys Asp Ser Ser Glu Ile Pro GlyAla Leu Trp His Ile Tyr Ala 2165 2170 2175 Gly Lys Asp Val Asp Lys IleArg Glu Phe Leu Gln Lys Ile Ser Lys 2180 2185 2190 Glu Gln Gly Leu GluVal Leu Pro Glu His Asp Pro Ile Arg Asp Gln 2195 2200 2205 Ser Trp TyrVal Asn Lys Lys Leu Arg Gln Arg Leu Leu Glu Glu Tyr 2210 2215 2220 GlyVal Arg Thr Cys Thr Leu Ile Gln Phe Leu Gly Asp Ala Ile Val 2225 22302235 2240 Leu Pro Ala Gly Ala Leu His Gln Val Gln Asn Phe His Ser CysIle 2245 2250 2255 Gln Val Thr Glu Asp Phe Val Ser Pro Glu His Leu ValGlu Ser Phe 2260 2265 2270 His Leu Thr Gln Glu Leu Arg Leu Leu Lys GluGlu Ile Asn Tyr Asp 2275 2280 2285 Asp Lys Leu Gln Val Lys Asn Ile LeuTyr His Ala Val Lys Glu Met 2290 2295 2300 Val Arg Ala Leu Lys Ile HisGlu Asp Glu Val Glu Asp Met Glu Glu 2305 2310 2315 2320 Asn 18 953 PRTHomo sapiens 18 Met Pro Ala Met Gln Leu Ala Ser Lys Asp Arg Val Ser GluArg Ser 1 5 10 15 Ser Ala Gly Ala His Lys Thr Asp Cys Leu Lys Leu AlaGlu Ala Gly 20 25 30 Glu Thr Gly Arg Ile Ile Leu Pro Asn Val Asn Ser AspSer Val His 35 40 45 Thr Lys Ser Glu Lys Asn Phe Gln Ala Val Ser Gln GlySer Val Pro 50 55 60 Ser Ser Val Met Ser Ala Val Asn Thr Met Cys Asn ThrLys Thr Asp 65 70 75 80 Val Ile Thr Ser Ala Ala Asp Thr Thr Ser Val SerSer Trp Gly Gly 85 90 95 Ser Glu Val Ile Ser Ser Leu Ser Asn Thr Ile LeuAla Ser Thr Ser 100 105 110 Ser Glu Cys Val Ser Ser Lys Ser Val Ser GlnPro Val Ala Gln Lys 115 120 125 Gln Glu Cys Lys Val Ser Thr Thr Ala ProVal Thr Leu Ala Ser Ser 130 135 140 Lys Thr Gly Ser Val Val Gln Pro SerSer Gly Phe Ser Gly Thr Thr 145 150 155 160 Asp Phe Ile His Leu Lys LysHis Lys Ala Ala Leu Ala Ala Ala Gln 165 170 175 Tyr Lys Ser Ser Asn AlaSer Glu Thr Glu Pro Asn Ala Ile Lys Asn 180 185 190 Gln Thr Leu Ser AlaSer Leu Pro Leu Asp Ser Thr Val Ile Cys Ser 195 200 205 Thr Ile Asn LysAla Asn Ser Val Gly Asn Gly Gln Ala Ser Gln Thr 210 215 220 Ser Gln ProAsn Tyr His Thr Lys Leu Lys Lys Ala Trp Leu Thr Arg 225 230 235 240 HisSer Glu Glu Asp Lys Asn Thr Asn Lys Met Glu Asn Ser Gly Asn 245 250 255Ser Val Ser Glu Ile Ile Lys Pro Cys Ser Val Asn Leu Ile Ala Ser 260 265270 Thr Ser Ser Asp Ile Gln Asn Ser Val Asp Ser Lys Ile Ile Val Asp 275280 285 Lys Tyr Val Lys Asp Asp Lys Val Asn Arg Arg Lys Ala Lys Arg Thr290 295 300 Tyr Glu Ser Gly Ser Glu Ser Gly Asp Ser Asp Glu Ser Glu SerLys 305 310 315 320 Ser Glu Gln Arg Thr Lys Arg Gln Pro Lys Pro Thr TyrLys Lys Lys 325 330 335 Gln Asn Asp Leu Gln Lys Arg Lys Gly Glu Ile GluGlu Asp Leu Lys 340 345 350 Pro Asn Gly Val Leu Ser Arg Ser Ala Lys GluArg Ser Lys Leu Lys 355 360 365 Leu Gln Ser Asn Ser Asn Thr Lys Ile AlaTrp Lys Arg Ala Val Arg 370 375 380 Gly Val Arg Glu Met Cys Asp Ala CysGlu Ala Thr Leu Phe Asn Ile 385 390 395 400 His Trp Val Cys Gln Lys CysGly Phe Val Val Cys Leu Asp Cys Tyr 405 410 415 Lys Ala Lys Glu Arg LysSer Ser Arg Asp Lys Glu Leu Tyr Ala Trp 420 425 430 Met Lys Cys Val LysGly Gln Pro His Asp His Lys His Leu Met Pro 435 440 445 Thr Gln Ile IlePro Gly Ser Val Leu Thr Asp Leu Leu Asp Ala Met 450 455 460 His Thr LeuArg Glu Lys Tyr Gly Ile Lys Ser His Cys His Cys Thr 465 470 475 480 AsnLys Gln Asn Leu Gln Val Gly Asn Phe Pro Thr Met Asn Gly Val 485 490 495Ser Gln Val Leu Gln Asn Val Leu Asn His Ser Asn Lys Ile Ser Leu 500 505510 Cys Met Pro Glu Ser Gln Gln Gln Asn Thr Pro Pro Lys Ser Glu Lys 515520 525 Asn Gly Gly Ser Ser Pro Glu Ser Asp Val Gly Thr Asp Asn Lys Leu530 535 540 Thr Pro Pro Glu Ser Gln Ser Pro Leu His Trp Leu Ala Asp LeuAla 545 550 555 560 Glu Gln Lys Ala Arg Glu Glu Lys Lys Glu Asn Lys GluLeu Thr Leu 565 570 575 Glu Asn Gln Ile Lys Glu Glu Arg Glu Gln Asp AsnSer Glu Ser Pro 580 585 590 Asn Gly Arg Thr Ser Pro Leu Val Ser Gln AsnAsn Glu Gln Gly Ser 595 600 605 Thr Leu Arg Asp Leu Leu Thr Thr Thr AlaGly Lys Leu Arg Val Gly 610 615 620 Ser Thr Asp Ala Gly Ile Ala Phe AlaPro Val Tyr Ser Met Gly Ala 625 630 635 640 Pro Ser Ser Lys Ser Gly ArgThr Met Pro Asn Ile Leu Asp Asp Ile 645 650 655 Ile Ala Ser Val Val GluAsn Lys Ile Pro Pro Ser Lys Thr Ser Lys 660 665 670 Ile Asn Val Lys ProGlu Leu Lys Glu Glu Pro Glu Glu Ser Ile Ile 675 680 685 Ser Ala Val AspGlu Asn Asn Lys Leu Tyr Ser Asp Ile Pro His Ser 690 695 700 Trp Ile CysGlu Lys His Ile Leu Trp Leu Lys Asp Tyr Lys Asn Ser 705 710 715 720 SerAsn Trp Lys Leu Phe Lys Glu Cys Trp Lys Gln Gly Gln Pro Ala 725 730 735Val Val Ser Gly Val His Lys Lys Met Asn Ile Ser Leu Trp Lys Ala 740 745750 Glu Ser Ile Ser Leu Asp Phe Gly Asp His Gln Ala Asp Leu Leu Asn 755760 765 Cys Lys Asp Ser Ile Ile Ser Asn Ala Asn Val Lys Glu Phe Trp Asp770 775 780 Gly Phe Glu Glu Val Ser Lys Arg Gln Lys Asn Lys Ser Gly GluThr 785 790 795 800 Val Val Leu Lys Leu Lys Asp Trp Pro Ser Gly Glu AspPhe Lys Thr 805 810 815 Met Met Pro Ala Arg Tyr Glu Asp Leu Leu Lys SerLeu Pro Leu Pro 820 825 830 Glu Tyr Cys Asn Pro Glu Gly Lys Phe Asn LeuAla Ser His Leu Pro 835 840 845 Gly Phe Phe Val Arg Pro Asp Leu Gly ProArg Leu Cys Ser Ala Tyr 850 855 860 Gly Val Val Ala Ala Lys Asp His AspIle Gly Thr Thr Asn Leu His 865 870 875 880 Ile Glu Val Ser Asp Val ValAsn Ile Leu Val Tyr Val Gly Ile Ala 885 890 895 Lys Gly Asn Gly Ile LeuSer Lys Ala Gly Ile Leu Lys Lys Phe Glu 900 905 910 Glu Glu Asp Leu AspAsp Ile Leu Arg Lys Arg Leu Lys Asp Ser Ser 915 920 925 Glu Ile Pro GlyAla Leu Trp His Ile Tyr Ala Gly Lys Asp Val Asp 930 935 940 Lys Ile ArgGlu Phe Leu Gln Lys Val 945 950 19 6995 DNA Homo sapiens 19 gaatgcaatatggataaaac acataccatg gaattgctac caaaggagaa gtttgtatcc 60 agaccacccacaccaaaatg tgttattgat attacaaatg acactaattt agaaaaggtg 120 gctcaggaaaactcaagtac ctttggcctt cagacacttc agaaaatgga tcctaatgtt 180 agtgattcaaaacactctat tgcaaatgca aaattcttgg aaacagcaaa aaaagattct 240 gaccagagctgggtcagtga tgtagttaaa gtggatctaa cccaatcaag tgttacaaat 300 gcttcttcaggaaatgatca cttgaacatg gaaaaagaga agtatgtctc ttacatttct 360 cctttaagtgcagtttctgt catggaagat aagctgcata agcgaagtcc acctccagag 420 actataaaatctaaacttaa tacttcagta gatactcaca agataaaatc cagcccatca 480 cctgaagttgttaaacccaa aataactcat tctcctgatt ctgtaaagtc taaggccact 540 tatgtgaacagccaagctac tggtgaaaga agattggcaa ataagataga acatgagcta 600 tcaagatgcagttttcatcc aattcctact cgaagcagta cattagaaac tacaaagagt 660 cctcttatcattgataaaaa tgagcatttt acagtttaca gagatcctgc acttattggg 720 tcagaaacaggagctaatca tatttcacct ttcctaagcc agcatccttt tcctcttcac 780 tcctcatctcatagaacctg tttaaatcca ggtacccatc atcctgcctt aactcctgca 840 ccccatttactagccggatc atctagtcaa actccattac ctaccattaa cactcatcct 900 ctgactagtggtccacacca tgctgttcat caccctcatt tacttcccac tgtgttacct 960 ggagtgcctactgcctcctt acttggtggc cacccacgac tagagagtgc tcatgccagc 1020 agcttgagccacttagcgct agcacaccag caacaacaac agttgttaca gcaccagtca 1080 cctcatcttcttggacaagc ccatccttct gcttcatata atcagcttgg actttatcca 1140 attatttggcagtatccaaa tggaacacat gcatactcag gacttggttt gccttcttct 1200 aagtgggttcacccagaaaa tgcagttaat gctgaagctt cattaaggag gaattctccc 1260 agtccttggctacatcagcc cacccctgtg acctcagcag atggtattgg attacttagt 1320 cacattcctgtcagaccttc cagtgcagag cctcatcggc ctcttaaaat tacagcccat 1380 tccagtccaccattgacaaa aactttagta gatcatcata aggaagaatt agaaagaaaa 1440 gcttttatggaaccattacg gtctgttgca tccacatcag ccaaaaatga cctggatcta 1500 aataggtcacagactggaaa agattgtcac ttacataggc attttgtgga tccagtatta 1560 aatcagttacagaggccacc ccaggagact ggagagaggt taaacaaata caaagaggaa 1620 caccgtcgaattcttcaaga aagtattgat gttgctccct ttacaactaa aatcaaggga 1680 cttgagggtgagagagagaa ttattccaga gtggcatcat catcttccag tcctaaaagc 1740 catatcatcaaacaagatat ggatgtagaa cgctcagtat cagatcttta taaaatgaag 1800 cactcagtgcctcagagttt accccaaagt aactatttca ctacattgtc taatagtgtg 1860 gtcaatgaaccaccaagatc atacccatcc aaagaagttt caaatattta cggtgataaa 1920 cagagtaatgcccttgcagc ggcagcagct aatcctcaaa ctctgacttc atttataaca 1980 tctctttcaaagcctccacc tttgattaaa caccaaccag aaagtgaagg tttagtaggc 2040 aagataccagaacatcttcc acatcagatt gcatctcact cagtaacaac cttcagaaat 2100 gattgtaggagtcctaccca tttgacagtt tcttctacaa atacactccg cagtatgcct 2160 gcattacatagagcaccagt atttcaccca ccaatccatc acagcctgga aagaaaggaa 2220 ggcagctatagtagtctttc ccctccaact ttaactccgg tgatgccagt aaatgctggt 2280 ggtaaagttcaagaatcaca gaagcctcca actctaatac ccgaaccaaa agactcccag 2340 gcaaattttaagagttcttc agaacagagt ttgacggaga tgtggagacc taataataac 2400 ctcagcaaagagaaaactga atggcatgtg gagaaaagca gcggaaagtt acaggctgct 2460 atggcatctgtcattgtgcg tccatcttct agtacaaaaa ctgatagtat gccagcaatg 2520 cagttagcttctaaagatcg wgttagtgaa agatcttcag ctggggcaca taaaacagat 2580 tgcctcaaactagcagaagc cggagaaact ggaagaatca ttttgccaaa tgtgaattca 2640 gacagtgttcacacaaaatc tgaaaaaaac tttcaggctg tctcacaggg cagtgttccc 2700 agttcagtcatgtctgctgt aaatacgatg tgtaatacca aaacggatgt aatcacatct 2760 gctgccgatactaccagtgt ttccagctgg ggtggttcag aagtaatttc ctctttatca 2820 aataccattttggcctctac atcatcagaa tgtgtatctt caaaaagtgt cagtcagcca 2880 gtggctcaaaaacaagaatg caaggtcagc accacagcac cagttacatt agccagtagt 2940 aagacaggaagtgttgttca acccagttct gggttctcag gcacaactga ttttatccat 3000 ttaaaaaagcacaaggcagc attggctgca gctcagtata aaagtagtaa tgccagtgag 3060 actgaacctaatgctataaa aaatcagaca ctttcagcct cccttcctct ggatagcact 3120 gtaatctgtagtacaattaa caaagcaaac tctgtaggaa atgggcaagc ttcccagaca 3180 agtcaaccaaactaccatac taaactgaaa aaggcctggc tcaccagaca ctcagaagaa 3240 gataaaaatactaataaaat ggaaaattca gggaattctg tatcagaaat tattaagcca 3300 tgttctgtcaacttaatagc ctctacatct agtgatatac aaaatagtgt agatagtaag 3360 atcatagttgataaatatgt aaaagatgat aaagtcaaca ggagaaaagc caaaagaact 3420 tatgaatctggctctgaaag tggagactca gatgaaagtg aaagcaagtc agagcaaagg 3480 actaaaagacaacctaagcc aacttacaaa aagaagcaaa atgatttgca aaagagaaaa 3540 ggtgaaatagaagaagattt gaaacccaat ggagttctca gcaggagtgc caaagaaaga 3600 agtaaactgaagttgcaaag caacagtaat actggcattc ctcgttcagt attgaaagat 3660 tggcgtaaagtcaagaagct gaagcaaact ggggaatcct ttttacagga tgactcctgc 3720 tgtgagatagggcctaattt acaaaagtgt cgagaatgta gacttattcg cagtaaaaaa 3780 ggagaagaaccagctcactc accagtattt tgtagatttt actactttag acggttgtca 3840 tttagtaaaaacggagtagt tagaatagat ggtttctctt ctcctgacca atatgatgat 3900 gaagctatgagtttgtggac acatgaaaat tttgaagatg atgaactaga tatagagact 3960 tctaaatatatcttggatat aataggtgat aagttctgtc aattagtaac atctgaaaaa 4020 acagctttgtcctgggtgaa aaaggatgcc aaaattgcct ggaaaagagc agtgagagga 4080 gtccgggagatgtgtgatgc atgtgaagca acattgttta acattcactg ggtctgccaa 4140 aaatgtggatttgtggtctg cttagattgt tacaaggcaa aggaaaggaa gagttctaga 4200 gataaagaactatatgcttg gatgaagtgt gtgaagggac agcctcatga tcacaaacat 4260 ttaatgccaacccaaattat acctggttct gttttgacag atcttctaga tgccatgcac 4320 actcttagggaaaaatatgg tattaaatcc cattgtcatt gtactaacaa acagaattta 4380 caagttggaaattttcctac aatgaatggt gtatctcaag ttttacagaa tgttcttaat 4440 cacagtaataaaatttctct gtgcatgcct gagtctcagc agcaaaatac tcctccgaag 4500 tctgagaaaaatggtggcag cagcccagag agtgatgtag gcacagataa caagttaact 4560 cctccagaatcccagtcacc actgcactgg ttagcagatc ttgcagagca aaaagccaga 4620 gaggaaaaaaaagaaaacaa agaacttacc cttgaaaacc aaattaaaga agaaagagaa 4680 caagacaactctgaatctcc aaatggcaga acatcacctc ttgtgtccca gaataatgaa 4740 caaggctcaaccttacggga tttgctgact acaacagctg gaaagctacg tgtggggtct 4800 acagatgctggcattgcctt tgccccagta tattcaatgg gagccccaag tagcaaaagt 4860 ggacggactatgcctaacat tcttgatgac ataattgctt cagttgttga aaacaaaatt 4920 ccaccaagtaaaacctccaa gataaatgta aaaccagagc ttaaagaaga gcctgaagaa 4980 agcataatatctgcagtgga tgaaaataat aaattataca gtgatatacc acattcttgg 5040 atctgtgagaagcatatttt atggcttaag gattataaga atagcagtaa ttggaagctt 5100 ttcaaagaatgttggaaaca aggacagcct gcagtggttt ctggtgtgca taagaaaatg 5160 aacattagcctatggaaggc ggaatcaatt agtcttgatt ttggagacca ccaagctgat 5220 ctcctgaactgcaaagatag catcatttca aatgccaatg ttaaggaatt ctgggatggt 5280 tttgaagaagtttcaaaacg gcagaaaaac aagagtggag aaacagttgt tttaaaattg 5340 aaagactggccttcaggaga agacttcaag actatgatgc cagcaagata cgaagatctt 5400 ttaaaaagtctgccattgcc agaatattgt aatccagaag gaaaattcaa tttggcctct 5460 catttgccaggattttttgt acgtcctgat ctaggaccca ggttgtgcag tgcctatggt 5520 gtagttgctgctaaagatca tgatatagga acaacaaatc tccatattga agtttctgat 5580 gttgtaaatatactagttta tgttggcata gcaaaaggaa atggcattct ctcaaaagca 5640 ggaattctcaagaaatttga ggaagaagat ttggatgaca ttttaaggaa aagattgaag 5700 gactcaagtgaaatacctgg tgctctgtgg catatttatg ctgggaaaga tgttgacaag 5760 ataagggaatttcttcaaaa gatttcaaaa gaacaaggcc ttgaagttct accagaacat 5820 gatccaatacgtgaccaaag ttggtatgtg aacaaaaagc tccgtcaaag gctgcttgaa 5880 gaatatggagtcagaacctg tactcttatt cagttccttg gtgatgctat tgttttgcca 5940 gcgggagcacttcatcaggt tcagaatttt cacagctgta ttcaggtaac tgaagatttt 6000 gtgtctccagaacatcttgt agagtcattt catttaacac aggaactgag acttttgaag 6060 gaagaaatcaattatgatga taaactacag gttaaaaata ttttgtatca tgcagtcaaa 6120 gaaatggtgagagccttgaa gatacacgag gatgaagtag aggatatgga agaaaattaa 6180 gtgtgatccagtttgatatt tttaggttgt tgaactggga ttacttaacc ttgaatgatg 6240 atatgtatgcacactgactt taagcttcat aaaaccatca gtgccaagaa attctctttg 6300 tagtaattacttgttactga caccacagca gtatagcata tgtcacagct cctgtgattc 6360 aatgttataaaacaagcaga attttaaaag cagcactata tagctgtttt gtattatagt 6420 gtatatgatgtttgtgaaaa tgccagattt aaaatgatgt atttattttt ggtaaaaaat 6480 aaaaaattctatgctatatt gttgatcaag tgtaaatgtg accttgtaca gtttactaaa 6540 attactgatatttttcacta cattgagaca gttactgtga gaataggaca caaacaccag 6600 ctattgcctgcatctgggaa attgctgaat cgcacagcag tcatgtcata atcagaaaat 6660 tactgccaaataattgtaaa atttgtaaag tataaagtat ataaagtaga tactaaatac 6720 agacacttcaatattttgtt gaagctattg actgtacaat taaacatttt caaaaggtgt 6780 aatttatttaaaattgtctc attttggtaa aatttatgtg aacttttaaa gctaaatatt 6840 aaacttaatatgctatgtaa atatatacat atatacattt aatgatgtat ttttttaaaa 6900 cattggcttgcttttgttaa agtgcaagtg ttacatatgg ctttgtacat taaagttgaa 6960 aggggttttacattttccat taaaaggact ttatc 6995 20 2055 PRT Homo sapiens 20 Met Asp LysThr His Thr Met Glu Leu Leu Pro Lys Glu Lys Phe Val 1 5 10 15 Ser ArgPro Pro Thr Pro Lys Cys Val Ile Asp Ile Thr Asn Asp Thr 20 25 30 Asn LeuGlu Lys Val Ala Gln Glu Asn Ser Ser Thr Phe Gly Leu Gln 35 40 45 Thr LeuGln Lys Met Asp Pro Asn Val Ser Asp Ser Lys His Ser Ile 50 55 60 Ala AsnAla Lys Phe Leu Glu Thr Ala Lys Lys Asp Ser Asp Gln Ser 65 70 75 80 TrpVal Ser Asp Val Val Lys Val Asp Leu Thr Gln Ser Ser Val Thr 85 90 95 AsnAla Ser Ser Gly Asn Asp His Leu Asn Met Glu Lys Glu Lys Tyr 100 105 110Val Ser Tyr Ile Ser Pro Leu Ser Ala Val Ser Val Met Glu Asp Lys 115 120125 Leu His Lys Arg Ser Pro Pro Pro Glu Thr Ile Lys Ser Lys Leu Asn 130135 140 Thr Ser Val Asp Thr His Lys Ile Lys Ser Ser Pro Ser Pro Glu Val145 150 155 160 Val Lys Pro Lys Ile Thr His Ser Pro Asp Ser Val Lys SerLys Ala 165 170 175 Thr Tyr Val Asn Ser Gln Ala Thr Gly Glu Arg Arg LeuAla Asn Lys 180 185 190 Ile Glu His Glu Leu Ser Arg Cys Ser Phe His ProIle Pro Thr Arg 195 200 205 Ser Ser Thr Leu Glu Thr Thr Lys Ser Pro LeuIle Ile Asp Lys Asn 210 215 220 Glu His Phe Thr Val Tyr Arg Asp Pro AlaLeu Ile Gly Ser Glu Thr 225 230 235 240 Gly Ala Asn His Ile Ser Pro PheLeu Ser Gln His Pro Phe Pro Leu 245 250 255 His Ser Ser Ser His Arg ThrCys Leu Asn Pro Gly Thr His His Pro 260 265 270 Ala Leu Thr Pro Ala ProHis Leu Leu Ala Gly Ser Ser Ser Gln Thr 275 280 285 Pro Leu Pro Thr IleAsn Thr His Pro Leu Thr Ser Gly Pro His His 290 295 300 Ala Val His HisPro His Leu Leu Pro Thr Val Leu Pro Gly Val Pro 305 310 315 320 Thr AlaSer Leu Leu Gly Gly His Pro Arg Leu Glu Ser Ala His Ala 325 330 335 SerSer Leu Ser His Leu Ala Leu Ala His Gln Gln Gln Gln Gln Leu 340 345 350Leu Gln His Gln Ser Pro His Leu Leu Gly Gln Ala His Pro Ser Ala 355 360365 Ser Tyr Asn Gln Leu Gly Leu Tyr Pro Ile Ile Trp Gln Tyr Pro Asn 370375 380 Gly Thr His Ala Tyr Ser Gly Leu Gly Leu Pro Ser Ser Lys Trp Val385 390 395 400 His Pro Glu Asn Ala Val Asn Ala Glu Ala Ser Leu Arg ArgAsn Ser 405 410 415 Pro Ser Pro Trp Leu His Gln Pro Thr Pro Val Thr SerAla Asp Gly 420 425 430 Ile Gly Leu Leu Ser His Ile Pro Val Arg Pro SerSer Ala Glu Pro 435 440 445 His Arg Pro Leu Lys Ile Thr Ala His Ser SerPro Pro Leu Thr Lys 450 455 460 Thr Leu Val Asp His His Lys Glu Glu LeuGlu Arg Lys Ala Phe Met 465 470 475 480 Glu Pro Leu Arg Ser Val Ala SerThr Ser Ala Lys Asn Asp Leu Asp 485 490 495 Leu Asn Arg Ser Gln Thr GlyLys Asp Cys His Leu His Arg His Phe 500 505 510 Val Asp Pro Val Leu AsnGln Leu Gln Arg Pro Pro Gln Glu Thr Gly 515 520 525 Glu Arg Leu Asn LysTyr Lys Glu Glu His Arg Arg Ile Leu Gln Glu 530 535 540 Ser Ile Asp ValAla Pro Phe Thr Thr Lys Ile Lys Gly Leu Glu Gly 545 550 555 560 Glu ArgGlu Asn Tyr Ser Arg Val Ala Ser Ser Ser Ser Ser Pro Lys 565 570 575 SerHis Ile Ile Lys Gln Asp Met Asp Val Glu Arg Ser Val Ser Asp 580 585 590Leu Tyr Lys Met Lys His Ser Val Pro Gln Ser Leu Pro Gln Ser Asn 595 600605 Tyr Phe Thr Thr Leu Ser Asn Ser Val Val Asn Glu Pro Pro Arg Ser 610615 620 Tyr Pro Ser Lys Glu Val Ser Asn Ile Tyr Gly Asp Lys Gln Ser Asn625 630 635 640 Ala Leu Ala Ala Ala Ala Ala Asn Pro Gln Thr Leu Thr SerPhe Ile 645 650 655 Thr Ser Leu Ser Lys Pro Pro Pro Leu Ile Lys His GlnPro Glu Ser 660 665 670 Glu Gly Leu Val Gly Lys Ile Pro Glu His Leu ProHis Gln Ile Ala 675 680 685 Ser His Ser Val Thr Thr Phe Arg Asn Asp CysArg Ser Pro Thr His 690 695 700 Leu Thr Val Ser Ser Thr Asn Thr Leu ArgSer Met Pro Ala Leu His 705 710 715 720 Arg Ala Pro Val Phe His Pro ProIle His His Ser Leu Glu Arg Lys 725 730 735 Glu Gly Ser Tyr Ser Ser LeuSer Pro Pro Thr Leu Thr Pro Val Met 740 745 750 Pro Val Asn Ala Gly GlyLys Val Gln Glu Ser Gln Lys Pro Pro Thr 755 760 765 Leu Ile Pro Glu ProLys Asp Ser Gln Ala Asn Phe Lys Ser Ser Ser 770 775 780 Glu Gln Ser LeuThr Glu Met Trp Arg Pro Asn Asn Asn Leu Ser Lys 785 790 795 800 Glu LysThr Glu Trp His Val Glu Lys Ser Ser Gly Lys Leu Gln Ala 805 810 815 AlaMet Ala Ser Val Ile Val Arg Pro Ser Ser Ser Thr Lys Thr Asp 820 825 830Ser Met Pro Ala Met Gln Leu Ala Ser Lys Asp Val Ser Glu Arg Ser 835 840845 Ser Ala Gly Ala His Lys Thr Asp Cys Leu Lys Leu Ala Glu Ala Gly 850855 860 Glu Thr Gly Arg Ile Ile Leu Pro Asn Val Asn Ser Asp Ser Val His865 870 875 880 Thr Lys Ser Glu Lys Asn Phe Gln Ala Val Ser Gln Gly SerVal Pro 885 890 895 Ser Ser Val Met Ser Ala Val Asn Thr Met Cys Asn ThrLys Thr Asp 900 905 910 Val Ile Thr Ser Ala Ala Asp Thr Thr Ser Val SerSer Trp Gly Gly 915 920 925 Ser Glu Val Ile Ser Ser Leu Ser Asn Thr IleLeu Ala Ser Thr Ser 930 935 940 Ser Glu Cys Val Ser Ser Lys Ser Val SerGln Pro Val Ala Gln Lys 945 950 955 960 Gln Glu Cys Lys Val Ser Thr ThrAla Pro Val Thr Leu Ala Ser Ser 965 970 975 Lys Thr Gly Ser Val Val GlnPro Ser Ser Gly Phe Ser Gly Thr Thr 980 985 990 Asp Phe Ile His Leu LysLys His Lys Ala Ala Leu Ala Ala Ala Gln 995 1000 1005 Tyr Lys Ser SerAsn Ala Ser Glu Thr Glu Pro Asn Ala Ile Lys Asn 1010 1015 1020 Gln ThrLeu Ser Ala Ser Leu Pro Leu Asp Ser Thr Val Ile Cys Ser 1025 1030 10351040 Thr Ile Asn Lys Ala Asn Ser Val Gly Asn Gly Gln Ala Ser Gln Thr1045 1050 1055 Ser Gln Pro Asn Tyr His Thr Lys Leu Lys Lys Ala Trp LeuThr Arg 1060 1065 1070 His Ser Glu Glu Asp Lys Asn Thr Asn Lys Met GluAsn Ser Gly Asn 1075 1080 1085 Ser Val Ser Glu Ile Ile Lys Pro Cys SerVal Asn Leu Ile Ala Ser 1090 1095 1100 Thr Ser Ser Asp Ile Gln Asn SerVal Asp Ser Lys Ile Ile Val Asp 1105 1110 1115 1120 Lys Tyr Val Lys AspAsp Lys Val Asn Arg Arg Lys Ala Lys Arg Thr 1125 1130 1135 Tyr Glu SerGly Ser Glu Ser Gly Asp Ser Asp Glu Ser Glu Ser Lys 1140 1145 1150 SerGlu Gln Arg Thr Lys Arg Gln Pro Lys Pro Thr Tyr Lys Lys Lys 1155 11601165 Gln Asn Asp Leu Gln Lys Arg Lys Gly Glu Ile Glu Glu Asp Leu Lys1170 1175 1180 Pro Asn Gly Val Leu Ser Arg Ser Ala Lys Glu Arg Ser LysLeu Lys 1185 1190 1195 1200 Leu Gln Ser Asn Ser Asn Thr Gly Ile Pro ArgSer Val Leu Lys Asp 1205 1210 1215 Trp Arg Lys Val Lys Lys Leu Lys GlnThr Gly Glu Ser Phe Leu Gln 1220 1225 1230 Asp Asp Ser Cys Cys Glu IleGly Pro Asn Leu Gln Lys Cys Arg Glu 1235 1240 1245 Cys Arg Leu Ile ArgSer Lys Lys Gly Glu Glu Pro Ala His Ser Pro 1250 1255 1260 Val Phe CysArg Phe Tyr Tyr Phe Arg Arg Leu Ser Phe Ser Lys Asn 1265 1270 1275 1280Gly Val Val Arg Ile Asp Gly Phe Ser Ser Pro Asp Gln Tyr Asp Asp 12851290 1295 Glu Ala Met Ser Leu Trp Thr His Glu Asn Phe Glu Asp Asp GluLeu 1300 1305 1310 Asp Ile Glu Thr Ser Lys Tyr Ile Leu Asp Ile Ile GlyAsp Lys Phe 1315 1320 1325 Cys Gln Leu Val Thr Ser Glu Lys Thr Ala LeuSer Trp Val Lys Lys 1330 1335 1340 Asp Ala Lys Ile Ala Trp Lys Arg AlaVal Arg Gly Val Arg Glu Met 1345 1350 1355 1360 Cys Asp Ala Cys Glu AlaThr Leu Phe Asn Ile His Trp Val Cys Gln 1365 1370 1375 Lys Cys Gly PheVal Val Cys Leu Asp Cys Tyr Lys Ala Lys Glu Arg 1380 1385 1390 Lys SerSer Arg Asp Lys Glu Leu Tyr Ala Trp Met Lys Cys Val Lys 1395 1400 1405Gly Gln Pro His Asp His Lys His Leu Met Pro Thr Gln Ile Ile Pro 14101415 1420 Gly Ser Val Leu Thr Asp Leu Leu Asp Ala Met His Thr Leu ArgGlu 1425 1430 1435 1440 Lys Tyr Gly Ile Lys Ser His Cys His Cys Thr AsnLys Gln Asn Leu 1445 1450 1455 Gln Val Gly Asn Phe Pro Thr Met Asn GlyVal Ser Gln Val Leu Gln 1460 1465 1470 Asn Val Leu Asn His Ser Asn LysIle Ser Leu Cys Met Pro Glu Ser 1475 1480 1485 Gln Gln Gln Asn Thr ProPro Lys Ser Glu Lys Asn Gly Gly Ser Ser 1490 1495 1500 Pro Glu Ser AspVal Gly Thr Asp Asn Lys Leu Thr Pro Pro Glu Ser 1505 1510 1515 1520 GlnSer Pro Leu His Trp Leu Ala Asp Leu Ala Glu Gln Lys Ala Arg 1525 15301535 Glu Glu Lys Lys Glu Asn Lys Glu Leu Thr Leu Glu Asn Gln Ile Lys1540 1545 1550 Glu Glu Arg Glu Gln Asp Asn Ser Glu Ser Pro Asn Gly ArgThr Ser 1555 1560 1565 Pro Leu Val Ser Gln Asn Asn Glu Gln Gly Ser ThrLeu Arg Asp Leu 1570 1575 1580 Leu Thr Thr Thr Ala Gly Lys Leu Arg ValGly Ser Thr Asp Ala Gly 1585 1590 1595 1600 Ile Ala Phe Ala Pro Val TyrSer Met Gly Ala Pro Ser Ser Lys Ser 1605 1610 1615 Gly Arg Thr Met ProAsn Ile Leu Asp Asp Ile Ile Ala Ser Val Val 1620 1625 1630 Glu Asn LysIle Pro Pro Ser Lys Thr Ser Lys Ile Asn Val Lys Pro 1635 1640 1645 GluLeu Lys Glu Glu Pro Glu Glu Ser Ile Ile Ser Ala Val Asp Glu 1650 16551660 Asn Asn Lys Leu Tyr Ser Asp Ile Pro His Ser Trp Ile Cys Glu Lys1665 1670 1675 1680 His Ile Leu Trp Leu Lys Asp Tyr Lys Asn Ser Ser AsnTrp Lys Leu 1685 1690 1695 Phe Lys Glu Cys Trp Lys Gln Gly Gln Pro AlaVal Val Ser Gly Val 1700 1705 1710 His Lys Lys Met Asn Ile Ser Leu TrpLys Ala Glu Ser Ile Ser Leu 1715 1720 1725 Asp Phe Gly Asp His Gln AlaAsp Leu Leu Asn Cys Lys Asp Ser Ile 1730 1735 1740 Ile Ser Asn Ala AsnVal Lys Glu Phe Trp Asp Gly Phe Glu Glu Val 1745 1750 1755 1760 Ser LysArg Gln Lys Asn Lys Ser Gly Glu Thr Val Val Leu Lys Leu 1765 1770 1775Lys Asp Trp Pro Ser Gly Glu Asp Phe Lys Thr Met Met Pro Ala Arg 17801785 1790 Tyr Glu Asp Leu Leu Lys Ser Leu Pro Leu Pro Glu Tyr Cys AsnPro 1795 1800 1805 Glu Gly Lys Phe Asn Leu Ala Ser His Leu Pro Gly PhePhe Val Arg 1810 1815 1820 Pro Asp Leu Gly Pro Arg Leu Cys Ser Ala TyrGly Val Val Ala Ala 1825 1830 1835 1840 Lys Asp His Asp Ile Gly Thr ThrAsn Leu His Ile Glu Val Ser Asp 1845 1850 1855 Val Val Asn Ile Leu ValTyr Val Gly Ile Ala Lys Gly Asn Gly Ile 1860 1865 1870 Leu Ser Lys AlaGly Ile Leu Lys Lys Phe Glu Glu Glu Asp Leu Asp 1875 1880 1885 Asp IleLeu Arg Lys Arg Leu Lys Asp Ser Ser Glu Ile Pro Gly Ala 1890 1895 1900Leu Trp His Ile Tyr Ala Gly Lys Asp Val Asp Lys Ile Arg Glu Phe 19051910 1915 1920 Leu Gln Lys Ile Ser Lys Glu Gln Gly Leu Glu Val Leu ProGlu His 1925 1930 1935 Asp Pro Ile Arg Asp Gln Ser Trp Tyr Val Asn LysLys Leu Arg Gln 1940 1945 1950 Arg Leu Leu Glu Glu Tyr Gly Val Arg ThrCys Thr Leu Ile Gln Phe 1955 1960 1965 Leu Gly Asp Ala Ile Val Leu ProAla Gly Ala Leu His Gln Val Gln 1970 1975 1980 Asn Phe His Ser Cys IleGln Val Thr Glu Asp Phe Val Ser Pro Glu 1985 1990 1995 2000 His Leu ValGlu Ser Phe His Leu Thr Gln Glu Leu Arg Leu Leu Lys 2005 2010 2015 GluGlu Ile Asn Tyr Asp Asp Lys Leu Gln Val Lys Asn Ile Leu Tyr 2020 20252030 His Ala Val Lys Glu Met Val Arg Ala Leu Lys Ile His Glu Asp Glu2035 2040 2045 Val Glu Asp Met Glu Glu Asn 2050 2055

We claim:
 1. A nucleic acid consisting of a) a nucleic acid that codes apolypeptide that includes one of amino acid sequences represented inSeq. ID Nos. 14 to 18 and 20, b) a nucleic acid that includes one ofnucleotide sequences represented in Seq. ID Nos. 1 to 13 and 19, c) anucleic acid having a hybridized nucleotide sequence hybridized with atleast one of said nucleotide sequences represented in said Seq. ID Nos.1 to 13 and 19 and said nucleotide sequences of said nucleic acid codingsaid polypeptide that includes one of said amino acid sequencesrepresented in said Seq. ID Nos. 14 to 18 and 20 under stringentconditions and wherein said hybridized nucleotide sequence codes for apolypeptide with biological activity of a co-modulator, or d) a nucleicacid having a degenerate nucleotide sequence corresponding to one ofsaid nucleotide sequences represented in said Seq. ID Nos. 1 to 13 and19, said nucleotide sequence of said nucleic acid coding saidpolypeptide that includes one of said amino acid sequences representedin said Seq. ID Nos. 14 to 18 and 20 and said hybridized nucleotidesequence within the framework of degeneration of the genetic code.
 2. Apolypeptide coded by a nucleic acid, said nucleic acid consisting of a)a nucleic acid that codes a polypeptide that includes one of amino acidsequences represented in Seq. ID Nos. 14 to 18 and 20, b) a nucleic acidthat includes one of nucleotide sequences represented in Seq. ID Nos. 1to 13 and 19, c) a nucleic acid having a hybridized nucleotide sequencehybridized with at least one of said nucleotide sequences represented insaid Seq. ID Nos. 1 to 13 and 19 and said nucleotide sequences of saidnucleic acid coding said polypeptide that includes one of said aminoacid sequences represented in said Seq. ID Nos. 14 to 18 and 20 understringent conditions and wherein said hybridized nucleotide sequencecodes for a polypeptide with biological activity of a co-modulator, ord) a nucleic acid having a degenerate nucleotide sequence correspondingto one of said nucleotide sequences represented in said Seq. ID Nos. 1to 13 and 19, said nucleotide sequence of said nucleic acid coding saidpolypeptide that includes one of said amino acid sequences representedin said Seq. ID Nos. 14 to 18 and 20 and said hybridized nucleotidesequence within the framework of degeneration of the genetic code.
 3. Apolypeptide comprising an amino acid sequence represented in one of Seq.ID Nos. 14 to 18 and
 20. 4. A vector containing at least one copy of anucleic acid, said nucleic acid consisting of a) a nucleic acid thatcodes a polypeptide that includes one of amino acid sequencesrepresented in Seq. ID Nos. 14 to 18 and 20, b) a nucleic acid thatincludes one of nucleotide sequences represented in Seq. ID Nos. 1 to 13and 19, c) a nucleic acid having a hybridized nucleotide sequencehybridized with at least one of said nucleotide sequences represented insaid Seq. ID Nos. 1 to 13 and 19 and said nucleotide sequences of saidnucleic acid coding said polypeptide that includes one of said aminoacid sequences represented in said Seq. ID Nos. 14 to 18 and 20 understringent conditions and wherein said hybridized nucleotide sequencecodes for a polypeptide with biological activity of a co-modulator, ord) a nucleic acid having a degenerate nucleotide sequence correspondingto one of said nucleotide sequences represented in said Seq. ID Nos. 1to 13 and 19, said nucleotide sequence of said nucleic acid coding saidpolypeptide that includes one of said amino acid sequences representedin said Seq. ID Nos. 14 to 18 and 20 and said hybridized nucleotidesequence within the framework of degeneration of the genetic code.
 5. Acell transfected with a nucleic acid or transfected with a vectorcontaining at least one copy of said nucleic acid, wherein said nucleicacid consists of a) a nucleic acid that codes a polypeptide thatincludes one of amino acid sequences represented in Seq. ID Nos. 14 to18 and 20, b) a nucleic acid that includes one of nucleotide sequencesrepresented in Seq. ID Nos. 1 to 13 and 19, c) a nucleic acid having ahybridized nucleotide sequence hybridized with at least one of saidnucleotide sequences represented in said Seq. ID Nos. 1 to 13 and 19 andsaid nucleotide sequences of said nucleic acid coding said polypeptidethat includes one of said amino acid sequences represented in said Seq.ID Nos. 14 to 18 and 20 under stringent conditions and wherein saidhybridized nucleotide sequence codes for a polypeptide with biologicalactivity of a co-modulator, or d) a nucleic acid having a degeneratenucleotide sequence corresponding to one of said nucleotide sequencesrepresented in said Seq. ID Nos. 1 to 13 and 19, said nucleotidesequence of said nucleic acid coding said polypeptide that includes oneof said amino acid sequences represented in said Seq. ID Nos. 14 to 18and 20 and said hybridized nucleotide sequence within the framework ofdegeneration of the genetic code.
 6. A method of expressing a nucleicacid consisting of a) a nucleic acid that codes a polypeptide thatincludes one of amino acid sequences represented in Seq. ID Nos. 14 to18 and 20, b) a nucleic acid that includes one of nucleotide sequencesrepresented in Seq. ID Nos. 1 to 13 and 19, c) a nucleic acid having ahybridized nucleotide sequence hybridized with at least one of saidnucleotide sequences represented in said Seq. ID Nos. 1 to 13 and 19 andsaid nucleotide sequences of said nucleic acid coding said polypeptidethat includes one of said amino acid sequences represented in said Seq.ID Nos. 14 to 18 and 20 under stringent conditions and wherein saidhybridized nucleotide sequence codes for a polypeptide with biologicalactivity of a co-modulator, or d) a nucleic acid having a degeneratenucleotide sequence corresponding to one of said nucleotide sequencesrepresented in said Seq. ID Nos. 1 to 13 and 19, said nucleotidesequence of said nucleic acid coding said polypeptide that includes oneof said amino acid sequences represented in said Seq. ID Nos. 14 to 18and 20 and said hybridized nucleotide sequence within the framework ofdegeneration of the genetic code; wherein said method comprises using acell for the expressing.
 7. A method of producing an antibody comprisingproducing a polypeptide or a part of said polypeptide, wherein saidpolypeptide or said part is coded by a nucleic acid, said nucleic acidconsisting of a) a nucleic acid that codes a polypeptide that includesone of amino acid sequences represented in Seq. ID Nos. 14 to 18 and 20,b) a nucleic acid that includes one of nucleotide sequences representedin Seq. ID Nos. 1 to 13 and 19, c) a nucleic acid having a hybridizednucleotide sequence hybridized with at least one of said nucleotidesequences represented in said Seq. ID Nos. 1 to 13 and 19 and saidnucleotide sequences of said nucleic acid coding said polypeptide thatincludes one of said amino acid sequences represented in said Seq. IDNos. 14 to 18 and 20 under stringent conditions and wherein saidhybridized nucleotide sequence codes for a polypeptide with biologicalactivity of a co-modulator, or d) a nucleic acid having a degeneratenucleotide sequence corresponding to one of said nucleotide sequencesrepresented in said Seq. ID Nos. 1 to 13 and 19, said nucleotidesequence of said nucleic acid coding said polypeptide that includes oneof said amino acid sequences represented in said Seq. ID Nos. 14 to 18and 20 and said hybridized nucleotide sequence within the framework ofdegeneration of the genetic code.
 8. A method of producing an antibodycomprising producing a polypeptide or a part of said polypeptide,wherein said polypeptide or said part of said polypeptide has an aminoacid sequence represented in one of Seq. ID Nos. 14 to 18 and
 20. 9. Anantibody against a polypeptide or a part of said polypeptide, whereinsaid antibody is made by a method comprising producing a polypeptide ora part of said polypeptide, wherein said polypeptide or said part iscoded by a nucleic acid, said nucleic acid consisting of a) a nucleicacid that codes a polypeptide that includes one of amino acid sequencesrepresented in Seq. ID Nos. 14 to 18 and 20, b) a nucleic acid thatincludes one of nucleotide sequences represented in Seq. ID Nos. 1 to 13and 19, c) a nucleic acid having a hybridized nucleotide sequencehybridized with at least one of said nucleotide sequences represented insaid Seq. ID Nos. 1 to 13 and 19 and said nucleotide sequences of saidnucleic acid coding said polypeptide that includes one of said aminoacid sequences represented in said Seq. ID Nos. 14 to 18 and 20 understringent conditions and wherein said hybridized nucleotide sequencecodes for a polypeptide with biological activity of a co-modulator, ord) a nucleic acid having a degenerated nucleotide sequence correspondingto one of said nucleotide sequences represented in said Seq. ID Nos. 1to 13 and 19, said nucleotide sequence of said nucleic acid coding saidpolypeptide that includes one of said amino acid sequences representedin said Seq. ID No. 14 to 18 and 20 and said hybridized nucleotidesequence within the framework of degeneration of the genetic code. 10.An antibody against a polypeptide or a part of said polypeptide, whereinsaid antibody is made by a method comprising producing a polypeptide ora part of said polypeptide, wherein said polypeptide or said part has anamino acid sequence represented in one of Seq. ID Nos. 14 to 18 and 20.11. A method of detecting a polypeptide or a part of a polypeptidecomprising detecting said polypeptide or said part of said polypeptidewith an antibody against said polypeptide or said part of saidpolypeptide, wherein said polypeptide or said part is coded by a nucleicacid, said nucleic acid consisting of a) a nucleic acid that codes apolypeptide that includes one of amino acid sequences represented inSeq. ID Nos. 14 to 18 and 20, b) a nucleic acid that includes one ofnucleotide sequences represented in Seq. ID Nos. 1 to 13 and 19, c) anucleic acid having a hybridized nucleotide sequence hybridized with atleast one of said nucleotide sequences represented in said Seq. ID Nos.1 to 13 and 19 and said nucleotide sequences of said nucleic acid codingsaid polypeptide that includes one of said amino acid sequencesrepresented in said Seq. ID Nos. 14 to 18 and 20 under stringentconditions and wherein said hybridized nucleotide sequence codes for apolypeptide with biological activity of a co-modulator, or d) a nucleicacid having a degenerate nucleotide sequence corresponding to one ofsaid nucleotide sequences represented in said Seq. ID Nos. 1 to 13 and19, said nucleotide sequence of said nucleic acid coding saidpolypeptide that includes one of said amino acid sequences representedin said Seq. ID No. 14 to 18 and 20 and said hybridized nucleotidesequence within the framework of degeneration of the genetic code.
 12. Amethod of detecting a polypeptide or a part of a polypeptide comprisingdetecting said polypeptide or said part of said polypeptide with anantibody against said polypeptide or said part of said polypeptide,wherein said polypeptide or said part has an amino acid sequencerepresented in one of Seq. ID Nos. 14 to 18 and
 20. 13. A method ofpreparation of a reagent for detection of mRNA comprising testing forthe mRNA with a probe having a nucleic acid sequence complementary to amember selected from the group consisting of nucleotide sequencesrepresented in Seq. ID Nos. 1 to 13 and 19, nucleotide sequences thatcode polypeptides that have amino acid sequences represented in Seq. IDNos. 14 to 18 and 20, hybridized nucleotide sequences hybridized withsaid nucleotide sequences represented in said Seq. ID Nos. 1 to 13 and19 and with said nucleotide sequences that code said polypeptides understringent conditions and degenerate nucleotide sequences correspondingto one of the foregoing said nucleotide sequences within the frameworkof genetic code degeneration.
 14. A method of preparation of a reagentfor detection of mRNA comprising testing for the mRNA with a probehaving nucleic acid sequences complementary to nucleotide sequences thatcode for polypeptides, or parts of polypeptides having amino acidsequences represented in Seq. ID Nos. 14 to 18 and
 20. 15. A method ofproducing an effective agent for treating steroid hormone-dependentdiseases, said method comprising using a nucleic acid as a targetsubstance for producing said agent and wherein said nucleic acidconsists of a) a nucleic acid that codes a polypeptide that includes oneof amino acid sequences represented in Seq. ID Nos. 14 to 18 and 20, b)a nucleic acid that includes one of nucleotide sequences represented inSeq. ID Nos. 1 to 13 and 19, c) a nucleic acid having a hybridizednucleotide sequence hybridized with at least one of said nucleotidesequences represented in said Seq. ID Nos. 1 to 13 and 19 and saidnucleotide sequences of said nucleic acid coding said polypeptide thatincludes one of said amino acid sequences represented in said Seq. IDNos. 14 to 18 and 20 under stringent conditions and wherein saidhybridized nucleotide sequence codes for a polypeptide with biologicalactivity of a co-modulator, or d) a nucleic acid having a degeneratenucleotide sequence corresponding to one of said nucleotide sequencesrepresented in said Seq. ID Nos. 1 to 13 and 19, said nucleotidesequence of said nucleic acid coding said polypeptide that includes oneof said amino acid sequences represented in said Seq. ID Nos. 14 to 18and 20 and said hybridized nucleotide sequence within the framework ofdegeneration of the genetic code.
 16. A method of producing an effectiveagent for treating steroid hormone-dependent diseases, said methodcomprising using a polypeptide a target substance for producing saidagent and wherein said polypeptide or a part of said polypeptide iscoded by a nucleic acid, said nucleic acid consisting of a) a nucleicacid that codes a polypeptide that includes one of amino acid sequencesrepresented in Seq. ID Nos. 14 to 18 and 20, b) a nucleic acid thatincludes one of nucleotide sequences represented in Seq. ID Nos. 1 to 13and 19, c) a nucleic acid having a hybridized nucleotide sequencehybridized with at least one of said nucleotide sequences represented insaid Seq. ID Nos. 1 to 13 and 19 and said nucleotide sequences of saidnucleic acid coding said polypeptide that includes one of said aminoacid sequences represented in said Seq. ID Nos. 14 to 18 and 20 understringent conditions and wherein said hybridized nucleotide sequencecodes for a polypeptide with biological activity of a co-modulator, ord) a nucleic acid having a degenerate nucleotide sequence correspondingto one of said nucleotide sequences represented in said Seq. ID Nos. 1to 13 and 19, said nucleotide sequence of said nucleic acid coding saidpolypeptide that includes one of said amino acid sequences representedin said Seq. ID Nos. 14 to 18 and 20 and said hybridized nucleotidesequence within the framework of degeneration of the genetic code.
 17. Amethod of identifying an effector of a polypeptide, said polypeptideincluding an amino acid sequence, which is represented in one of Seq. IDNos. 14 to 18 and 20 or which is coded by a nucleic acid that codes saidpolypeptide, a nucleic acid that includes one of nucleotide sequencesrepresented in Seq. ID Nos. 1 to 13 and 19, a nucleic acid having ahybridized nucleotide sequence hybridized with at least one of saidnucleotide sequences represented in said Seq. ID Nos. 1 to 13 and 19 andwith said nucleotide sequences of said nucleic acid coding saidpolypeptide under stringent conditions, said hybridized nucleotidesequence coding for a polypeptide with biological activity of aco-modulator, or a nucleic acid having a degenerate nucleotide sequencecorresponding to one of said nucleotide sequences represented in saidSeq. ID Nos. 1 to 13 and 19, said nucleotide sequences of said nucleicacid coding said polypeptide and said hybridized nucleotide sequence,within the framework of degeneration of the genetic code. wherein saidmethod comprises the step of using said nucleic acid, a cell transfectedwith said nucleic acid, a cell transfected with a vector containing atleast one copy of said nucleic acid, said polypeptide including theamino acid sequence represented in one of said Seq. ID Nos. 14 to 18 and20 or said polypeptide coded by said nucleic acid.
 18. A testing systemfor identifying effectors of a polypeptide, said polypeptide includingan amino acid sequence, which is represented in one of Seq. ID Nos. 14to 18 and 20 or which is coded by a nucleic acid consisting of a nucleicacid that codes said polypeptide, a nucleic acid that includes one ofnucleotide sequences represented in Seq. ID Nos. 1 to 13 and 19, anucleic acid having a hybridized nucleotide sequence hybridized with oneof said nucleotide sequences represented in said Seq. ID Nos. 1 to 13and 19 and said nucleotide sequences of said nucleic acid coding saidpolypeptide under stringent conditions, said hybridized nucleotidesequence coding for a polypeptide with biological activity of aco-modulator, or a nucleic acid having a degenerated nucleotide sequencecorresponding to one of said nucleotide sequences represented in saidSeq. ID Nos. 1 to 13 and 19, said nucleotide sequences of said nucleicacid coding said polypeptide and said hybridized nucleotide sequence,within the framework of degeneration of the genetic code; wherein a) areporter gene is expressed in a cell transfected with said nucleic acidor in a cell transfected with a vector containing at least one copy ofsaid nucleic acid; b) said cell, if said cell contains no nuclearreceptor or only a small amount thereof, is also transfected with avector containing DNA of the nuclear receptor; c) said cell is culturedin the presence or absence of a plurality of test substances; and d) achange in expression of said reporter gene is measured to determinewhich of said test substances is or are identified as an effector oreffectors.
 19. The testing system as defined in claim 18, wherein thenuclear receptor is an androgen receptor.
 20. The testing system asdefined in claim 18, wherein the cell is cultured in the presence orabsence of the test substances and in the simultaneous presence of aligand of the nuclear receptor.
 21. The testing system as defined inclaim 20, wherein the nuclear receptor is an androgen receptor and theligand is an androgen.
 22. A method for preparing a pharmaceuticalagent, wherein a) providing a testing system for identifying effectorsof a polypeptide, said polypeptide including an amino acid sequence,which is represented in one of Seq. ID Nos. 14 to 18 and 20 or which iscoded by a nucleic acid consisting of a nucleic acid that codes saidpolypeptide, a nucleic acid that includes one of nucleotide sequencesrepresented in Seq. ID Nos. 1 to 13 and 19, a nucleic acid having ahybridized nucleotide sequence hybridized with at least one of saidnucleotide sequences represented in said Seq. ID Nos. 1 to 13 and 19 andsaid nucleotide sequences of said nucleic acid coding said polypeptideunder stringent conditions, said hybridized nucleotide sequence codingfor a polypeptide with biological activity of a co-modulator, or anucleic acid having a degenerate nucleotide sequence corresponding toone of said nucleotide sequences represented in said Seq. ID Nos. 1 to13 and 19, said nucleotide sequences of said nucleic acid coding saidpolypeptide and said hybridized nucleotide sequence, within theframework of degeneration of the genetic code; wherein a reporter geneis expressed in a cell transfected with said nucleic acid or in a celltransfected with a vector containing at least one copy of said nucleicacid; said cell, if said cell contains no nuclear receptor or only asmall amount thereof, is also transfected with a vector containing DNAof the nuclear receptor; said cell is cultured in the presence orabsence of a plurality of test materials; and a change in expression ofsaid reporter gene is measured to determine which of said test materialsis or are identified as an effector or effectors; b) substances arebrought in contact with said testing system, c) the action of thesubstances on the testing system is measured by comparison withcontrols, d) one of the substances measured in step c) and showingmodulation of activity of said polypeptide is identified, and e) the oneof the substances identified in step d) is mixed with formulationmaterials commonly used in pharmaceuticals to form said pharmaceuticalagent.